(12) INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) 



(19) World Intellectual Property Organization 

International Bureau 

(43) International Publication Date 
11 October 2001 (11.10.2001) 




PCT 



lillll 

(10) International Publication Number 

wo 01/75171 A2 



(51) International Patent Classification^: C12Q 1/68, 

GOIN 33/543, 33/574 // C12N 5/08 

(21) International Application Number: PCT/USOl/ 10631 

(22) International Filing Date: 2 April 2001 (02,04.2001) 



(25) Filing Language: 

(26) Publication Language: 



English 
English 



(30) 



Priority Data: 

60/194'241 
60/219,862 
60/221,300 
60/256.592 



3 April 2000 (03.04.2000) US 

20 July 2000 (20.07.2000) US 

27 July 2000 (27.07.2000) US 

1 8 December 2000 (1 8.1 2.2000) US 



(71) Applicant (for all designated States except US): CORIXA 
CORPORATION [US/US]; 1124 Columbia Street, Suite 
200, Seattle, WA 98104 (US). 

(72) Inventors; and 

(75) Inventors/Applicants (/ari^/^ow/)^: HOUGHTON, Ray- 
mond, L. lUS/US]; 2636 242nd Place S.E., BoihelK WA 
98021 (US). DILLON, Davin, C. [US/US]; 18112 N.W. 
Momreux Drive, Issaquah, WA 98027 (US). MOLESH, 



David, Alan [US/US]; 12385 N.E. Klabo Road, Kingston, 
WA 98346 (US). XU, Jiangcbun fUSAJS]; 15805 S.E. 
43rd Place, Bellevue, WA 98006 (US). ZEHENTNER, 
Barbara [DE/US]; 4420 Bluff Lane, Bainbridge Island, 
WA 98110 (US). PERSING, David, H. [USAJS]; 22401 
N.E. 25th Way, Redmond, WA 98053 (US). 

(74) Agents: POTTER, Jane, E., R. cl al.; Seed Inlcllcclual 
Pwperty Uw Group PLLC, Suite 6300, 701 Fifth Avenue, 
Seattle, WA 98104-7092 (US). 

(81) Designated States (national): AE, AG, AL, AM, AT, AU, 
AZ, BA, BB, BG, BR, BY, BZ, CA, CH, CN, CO, CR, CU, 
CZ, DE, DK, DM, DZ, EE, ES, Fl, GB, GD, GE, GH, GM, 
HR. HU, TD, TL, IN, IS, JP, KE, KG, KP, KR, KZ, EC, LK, 
LR, LS, LT, LU, LV, MA, MD, MG, MK. MN, MW, MX, 
MZ, NO, NZ, PL, PT, RO, RU, SD, SE, SG, SI, SK, SL, 
TJ, TM, TR, TT, TZ, UA. UG, US, UZ, VN, YU, ZA, ZW. 

(84) Designated States (regional): ARIPO patent (GH, GM, 
KE, IS, MW, MZ, SD, SL, SZ, TZ, UG, ZW), Eurasian 
patent (AM, AZ, BY, KG, KZ, MD, RU, TJ, TM), European 
patent (AT, BE, CH, CY, DE, DK, ES, H, FR, GB, GR, IE, 
IT, LU, MC, NL, PT, SE, TR), OAPl patent (BF, BJ, CF, 
CG, Cl, CM, GA, GN, GW, ML, MR, NE, SN, TD, TG). 

[Continued on next page] 



(54) Title: METHODS, COMPOSITIONS AND KITS FOR THE DETECTION AND MONITORING OF BREAST CANCER 



< 



IT) 



6500 
6000 



mRNA Expression proflej lor B3nD, B533S ond B725P 



□ cftpiw oi B726P 
0 copies 0^ BSiJS 
^ copies of 031 to 



443 




8lT. 


457 




8.T. 


395 




B.T. 


10A 




B.T. 


08 




B.T. 


lie 




B.T. 


239 




B.l. 


207* '- 


BJ 


437 




B.T. 


391 




B.T. 


392 




B.T. 


574 




B.T. 


BIG 




8.1. 


430 




B.T. 


485 




B.T. 


S80 




B.l. 


469 




B.T. 


467 




B.T. 


387 




S.T. 


318 




B.T. 



154A = B.T. 
387A = aL 
3I7A = B.U. 
462A = B.M. 
511-3 = BJ». 
S82 = B.B. 
S15 = B.B. 
tJB - Uterus 
S40 = Activated PBUC 
S39 = Resting PBUC 
270 = Uver 
532A = Coloo 
654 = Spleen 
50ZA - Csophogus 
S1 13 = Ttnriod Oond 
340A = Sptnol Cord 
^71 A = Bone 
it6A = Ttnmus 
BCI = Cortioge 
I62A = Lung 



CT2I = Troctieo 
35fiA = Adrenal Chnd 
£07B - Bone Uarrov 
35aA/U9D - Brain 
337A = Small Intestine 
3A = Slonwch 
3£9C = Heort 
118A = IGdney 
45A - Prostote 
343A ~ Pthiotari 
396B - Skin 
333A = Sktlslot Uusde 
57A = Ovory 
345A = Setnio 
476 = Uterus 
351 A = Ureler 
36tA = Veno Cow 



(57) Abstract: Compositions and methods for the therapy and diagnosis of cancer, such as breast cancer, are disclosed, Composi- 
tions may comprise one or more breast tumor proteins, immunogenic portions thereof, or polynucleotides that encode such portions. 
Alternatively, a therapeutic composition may comprise an antigen presenting cell that expresses a breast tumor protein, or a T cell 
Q thai is specific for cells expressing such a protein. Such compositions may be used, for example, for the prevention and U^eatment of 
^ diseases such as breast cancer Diagnostic methods based on detecting a breast tumor prtnein, or mRNA encoding such a protein, in 
^ a sample are also provided. 
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METHODS, COMPOSITIONS AND KITS 
FOR THE DETECTION AND MONITORING OF BREAST CANCER 

TECHNICAL FIELD OF THE INVENTION 

The present invention relates generally to the field of cancer diagnostics. 
5 More specifically, the present invention relates to methods, compositions and kits for 
the detection of cancer that employ oUgonucleotide hybridization and/or ampUfication 
to simnkaneously detect two or more , tissue-specific polynucleotides in a biological 
sample suspected of containing cancer aells^^; 

BACKGROUND OF THE INVENTION 

10 Cancer remains a significant health problem throughout the world. The 

failure of conventional cancer treatment regimens can commonly be attributed, in part, 
to delayed disease diagnosis. Although significant advances have been made in the area 
of cancer diagnosis, there still remains a need for improved detection methodologies 
that permit early, reliable and sensitive determination of the presence of cancer cells. 

15 Breast cancer is second only to lung cancer in mortality among women 

in the U.S., affecting more than 180,000 women each year and resulting in 
approximately 40,000-50,000 deaths annually. For women in North America, the life- 
time odds of getting breast cancer are one in eight. 

Management of the disease currently relies on a combination of early 

20 diagnosis (through routine breast screening procedures) and aggressive treatment, 
which may include one or more of a variety of treatments such as surgery, radiotherapy, 
chemotherapy and honnone therapy. The course of treatment for a particular breast 
cancer is often selected based on a variety of prognostic parameters, including analysis 
of specific tumor markers. See, e.g.. Porter- Jordan et al.. Breast Cancer 8:73-100 

25 (1994). The use of established markers often leads, however, to a result that is difficult 
to interpret; and the high mortality observed in breast cancer patients indicates that 
improvements are needed in the diagnosis of the disease. 

The recent introduction of immunotherapeutic approaches to breast 
cancer treatment which are targeted to Her2/neu have provided significant motivation to 
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identify additional breast cancer specific genes as targets for therapeutic antibodies and 
T-cell vaccines as well as for diagnosis of the disease. To this end, mammaglobin, has 
been identified as one of the most breast-specific genes discovered to date, being 
expressed m approximately 70-80% of breast cancers. Because of its highly tissue- 
5 specific distribution, detection of mammaglobm gene expression has been used to 
identify micrometastatic lesions in lymph node tissues and, more recently, to detect 
circulating breast cancer cells in peripheral blood of breast cancer patients with known 
primary and metastatic lesions. 

Mammaglobin is a homologue of a rabbit uteroglobin and the rat steroid 
10 binding protein subunit C3 and is a low molecular weight protein that is highly 
glycosylated. Watson et aL, Cancer Res. 56:860-5 (1996); Watson et al.. Cancer Res, 
59:3028-3031 (1999); Watson et al., Oncogene 16:817-24 (1998). In contrast to its 
homologs, mammaglobin has been reported to be breast specific and overexpression has 
been described in breast tumor biopsies (23%), primary and metastatic breast tumors 
15 (-75%) with reports of the detection of mammaglobin mRNA expression in 91% of 
lymph nodes from metastatic breast cancer patients. Leygue et al., J. Pathol 189 :28-33 
(1999) and Min et al.. Cancer Res, 58:4581-4584 (1998). 

Since mammaglobin gene expression is not a universal feature of breast 
cancer, the detection of this gene alone may be insufficient to permit the reliable 
20 detection of all breast cancers. Accordingly, what is needed in the art is a methodology 
that employs the detection of two or more breast cancer specific genes in order to 
improve the sensitivity and reliability of detection of micrometastases, for example, in 
lymph nodes and bone marrow and/or for recognition of anchorage-independent cells in 
the peripheral circulation. 
25 The present invention achieves these and other related objectives by 

providing methods that are useful for the identification of tissue-specific 
polynucleotides, in particular tumor-specific polynucleotides, as well as methods, 
compositions and kits for the detection and monitoring of cancer cells in a patient 
afflicted with the disease. 
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SUMMARY OF THE INVENTION 

By certain embodiments, the present invention provides 
methods for identifying one or more tissue-specific polynucleotides which methods 
comprise the steps of: (a) performing a genetic subtraction to identify a pool of 
5 polynucleotides from a tissue of interest; (b) performing a DNA microarray analysis to 
identify a first subset of said pool of polynucleotides of interest wherein each member 
polynucleotide of said first subset is at least two-fold over-expressed in said tissue of 
interest as compared to a control tissue; and (c ) perfonning a quantitative polymerase 
chain reaction analysis on polynucleotides within said first subset to identify a second 
10 subset of polynucleotides that are at least two-fold over-expressed as compared to the 
control tissue. Preferred genetic subtractions are selected fi-om the group consistmg of 
differential display and cDNA subtraction and are described in further detail herein 
below. 

Alternate embodiments of the present invention provide methods of 

15 identifying a subset of polynucleotides showing concordant and/or complementary 
tissue-specific expression profiles in a tissue of interest. Such methods comprise the 
steps of, (a) performing an expression analysis selected fi-om the group consisting of 
DNA microarray and quantitative PGR to identify a first polynucleotides that is at least 
two-fold over-expressed in a tissue of interest as compared to a control tissue; and (b) 

20 performing an expression analysis selected fi-om the group consisting of DNA 
microarray and quantitative PGR to identify a first polynucleotides that is at least two- 
fold over-expressed in a tissue of interest as compared to a control tissue. 

Further embodiments of the present invention provide methods for 
detecting the presence of a cancer cell in a patient. Such methods comprise the steps of: 

25 (a) obtaining a biological sample fi-om the patient; (b) contacting the biological sample 
with a first oligonucleotide pair wherein the members of the first oligonucleotide pair 
hybridize, under moderately stringent conditions, to a first polynucleotide and the 
complement thereof, respectively; (c) contacting the biological sample with a second 
oligonucleotide pair wherein the members of the second oligonucleotide pair hybridize, 

30 under moderately stringent conditions, to a second polynucleotide and the complement 
thereof, respectively and wherein the first polynucleotide is unrelated in nucleotide 
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sequence to the second polynucleotide; (d) amplifying the first polynucleotide and the 
second polynucleotide; and (e) detectmg the amplified first polynucleotide and the 
amplified second polynucleotide; wherein the presence of the amplified first 
polynucleotide or amplified second polynucleotide indicates the presence of a cancer 
5 cell in the patient. 

By some embodiments, detection of the amplified first and/or second 
polynucleotides may be preceded by a fractionation step such as, for example, gel 
electrophoresis. Alternatively or additionally, detection of the amplified fu*st and/or 
second polynucleotides may be achieved by hybridization of a labeled oligonucleotide 
10 probe that hybridizes specifically, under moderately stringent conditions, to the furst or 
second polynucleotide. Oligonucleotide labeling may be achieved by incorporating a 
radiolabeled nucleotide or by incorporating a fluorescent label. 

In certain preferred embodiments, cells of a specific tissue type may be 
enriched fi-om the biological sample prior to the steps of detection. Enrichment may be 
15 achieved by a methodology selected from the group consisting of cell capture and cell 
depletion. Exemplary cell capture methods include immunocapture and comprise the 
steps of: (a) adsorbing an antibody to a tissue-specific cell surface to cells said 
biological sample; (b) separating the antibody adsorbed tissue-specific cells from the 
remainder of the biological sample. Exemplary cell depletion may be achieved by 
20 cross-linking red cells and v^hite cells follov^ed by a subsequent fractionation step to 
remove the cross-hnked cells. 

Akemative embodiments of the present invention provide methods for 
detemiining the presence or absence of a cancer in a patient, comprising the steps of: (a) 
contacting a biological sample obtained fi-om the patient with an oligonucleotide that 
25 hybridizes to a polynucleotide that encodes a breast tumor protein; (b) detecting in the 
sample a level of a polynucleotide (such as, for example, mRNA) that hybridizes to the 
oligonucleotide; and (c) comparing the level of polynucleotide that hybridizes to the 
oligonucleotide with a predetermined cut-off value, and therefrom determining the 
presence or absence of a cancer in the patient. Withm certain embodiments, the amount 
30 of mRNA is detected via polymerase chain reaction using, for example, at least one 
oligonucleotide primer that hybridizes to a polynucleotide encoding a polypeptide as 
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recited above, or a complement of such a polynucleotide. Within other embodiments, 
the amount of mRNA is detected using a hybridization technique, employing an 
oligonucleotide probe that hybridizes to a polynucleotide that encodes a polypeptide as 
recited above, or a complement of such a polynucleotide. 
5 In related aspects, methods are provided for monitoring the progression 

of a cancer in a patient, comprising the steps of: (a) contacting a biological sample 
obtained from a patient with an oligonucleotide that hybridizes to a polynucleotide that 
encodes a breast tumor protein; (b) detecting in the sample an amount of a 
polynucleotide that hybridizes to the oligonucleotide; (c) repeating steps (a) and (b) 
10 using a biological sample obtained from the patient at a subsequent point in time; and 
(d) comparing the amount of polynucleotide detected in step (c) with the amount 
detected in step (b) and therefrom monitoring the progression of the cancer in the 
patient. 

Certain embodiments of the present invention provide that the step of 
15 amplifying said first polynucleotide and said second polynucleotide is achieved by the 
polymerase chain reaction (PCR). 

Within certain embodiments, the cancer cell to be detected may be 
selected from the group consisting of prostate cancer, breast cancer, colon cancer, 
ovarian cancer, lung cancer head & neck cancer, lymphoma, leukemia, melanoma, liver 
20 cancer, gastric cancer, kidney cancer, bladder cancer, pancreatic cancer and endometrial 
cancer. Still finther embodiments of the present invention provide that the biological 
sample is selected from the group consisting of blood, a lymph node and bone marrow. 
The lymph node may be a sentinel lymph node. 

Within specific embodiments of present invention it is provided that the 
25 first polynucleotide is selected from the group consisting of mamanaglobin, lipophilin 
B, GABAtc (B899P), B726P, B511S, B533S, B305D and B311D. Other embodiments 
provide that the second polynucleotide is selected from the group consisting of 
mammaglobin, lipophilin B, GABAn (B899P), B726P, B511S, B533S, B305D and 
B311D. 

30 Alternate embodiments of the present invention provide methods for 

detecting the presence or absence of a cancer in a patient, comprising the steps of: (a) 
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contacting a biological sample obtained from a patient with a first oligonucleotide that 
hybridizes to a polynucleotide selected from the group consisting of mamm^obin and 
lipophilin B; (b) contacting the biological sample with a second oligonucleotide that 
hybridizes to a polynucleotide sequence selected from the group consisting of GABAn 
5 (B899P), B726P, B511S, B533S, B305D and B311D; (c) detecting in the sample an 
amount of a polynucleotide that hybridizes to at least one of the oligonucleotides; and 
(d) comparing the amount of polynucleotide that hybridizes to the oligonucleotide to a 
predetermined cut-off value, and therefrom determining the presence or absence of a 
cancer in the patient. 

10 Acconiing to certain embodiments, oligonucleotides may be selected 

from those disclosed herein such as those presented in SEQ ID Nos:33-72. By other 
embodiments, the amount of polynucleotide that hybridizes to the oligonucleotide is 
determined using a polymerase chain reaction. Alternatively, the amount of 
polynucleotide that hybridizes to the oligonucleotide may be determined using a 

15 hybridization assay. 

Still other embodiments of the present invention provide methods for 
determining the presence or absence of a cancer cell in a patient, comprising the steps 
of: (a) contacting a biological sample obtamed from a patient Avith a first 
oligonucleotide that hybridizes to a polynucleotide selected from the group consisting 

20 of a polynucleotide depicted in SEQ ID NO:73 and SEQ ID NO:74 or complement 
thereof; (b) contacting the biological sample with a second oligonucleotide that 
hybridizes to a polynucleotide depicted in SEQ ID NO:75 or complement thereof; (c) 
contacting the biological sample with a third oligonucleotide that hybridizes to a 
polynucleotide selected from the group consisting of a polynucleotide depicted in SEQ 

25 ID N0:1, SEQ ID N0:3, SEQ ID N0:5, SEQ ID NO:6 and SEQ ID NO:7 or 
complement thereof; (d) contacting the biological sample with a fourth oligonucleotide 
that hybridizes to a polynucleotide selected from flie group consisting of a 
polynucleotide depicted in SEQ ID NO: 11 or complement thereof; (e) contacting the 
biological sample with a fifth oligonucleotide that hybridizes to a polynucleotide 
30 selected from the group consisting of a polynucleotide depicted in SEQ ID NO: 13, 15 
and 17 or complement thereof; (f) contacting tiie biological sample with a sixth 
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oligonucleotide that hybridizes to a polynucleotide selected from the group consisting 
of a polynucleotide depicted in SEQ ID NO:19, SEQ ID NO:20, SEQ ID N0:21, SEQ 
ID NO:22, SEQ ID NO:23 and SEQ ID NO:24 or complement thereof; (g) contacting 
the biological sample with a seventh oligonucleotide that hybridizes to a polynucleotide 
5 depicted in SEQ ID NO:30 or complement thereof; (h) contacting the biological sample 
with an eighth oligonucleotide that hybridizes to a polynucleotide depicted in SEQ ID 
NO:32 or complement thereof; (i) contacting the biological sample with a ninth 
oligonucleotide that hybridizes to a polynucleotide depicted in SEQ ID NO:76 or 
complement thereof; (j) detecting in the sample a hybridized oligonucleotide of any one 
10 of steps (a) through (i); and (j) comparing the amount of polynucleotide that hybridizes 
to the oligonucleotide to a predetermined cut-off value, wherein the presence of a 
hybridized oligonucleotide in any one of steps (a) through (i) in excess of the pre- 
determined cut-off value indicates the presence of a cancer cell in the biological sample 
of said patient. 

15 Other related embodiments of the present invention provide methods for 

determining the presence or absence of a cancer cell in a patient, comprising the steps 
of: (a) contacting a biological sample obtained from a patient with a first 
oligonucleotide and a second oligonucleotide wherein said first and second 
oligonucleotides hybridize under moderately stringent conditions to a first and a second 

20 polynucleotide selected fi-om the group selected from the group consisting of SEQ ID 
NO:73, SEQ ID NO:74, SEQ ID NO:75, SEQ ID NO:l, SEQ ID NO:3, SEQ ID NO:5, 
SEQ ID NO:6, SEQ ID N0:7, SEQ ID NO:ll, SEQ ID NO:13, SEQ ID N0:15, SEQ 
ID N0:17, SEQ ID N0:19, SEQ ID NO:20, SEQ ID N0:21, SEQ ID NO:22, SEQ ID 
NO:23, SEQ ID NO:24, SEQ ID NO:30, SEQ ID NO:32, and SEQ ID NO:76 and 

25 wherein said first polynucleotide is unrelated structurally to said second polynucleotide; 

(b) detecting in the sample said first and said second hybridized oligonucleotides; and 

(c) comparing the amount of polynucleotide that hybridizes to the oUgonucleotide to a 
predetermined cut-off value, wherein the presence of a hybridized first oligonucleotide 
or a hybridized second oligonucleotide in excess of the pre-determined cut-off value 

30 indicates the presence of a cancer cell in the biological sample of said patient 
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Other related embodiments of the present invention provide methods for 
determining the presence or absence of a cancer cell in a patient, comprising the steps 
of: (a) contacting a biological sample obtained from a patient with a first 
oligonucleotide and a second oligonucleotide wherein said nrst and second 
5 oligonucleotides hybridize under moderately stringent conditions to a first and a second 
polynucleotide are both tissue-specific polynucleotides of the cancer to be detected and 
wherein said first polynucleotide is unrelated structurally to said second polynucleotide; 

(b) detecting in the sample said first and said second hybridized oligonucleotides; and 

(c) comparing the amount of polynucleotide that hybridizes to the oligonucleotide to a 
10 predetemiined cut-off value, wherein the presence of a hybridized first oligonucleotide 

or a hybridized second oligonucleotide in excess of the pre-determined cut-off value 
indicates the presence of a cancer cell in the biological sample of said patient. 

In other related aspects, the present invention further provides 
compositions useful in the methods disclosed herein. Exemplary compositions 

15 comprise two or more oligonucleotide primer pairs each one of which specifically 
hybridizes to a distinct polynucleotide. Exemplary oligonucleotide primers suitable for 
compositions of the present invention are disclosed herein by SEQ ID NOs: 33-71. 
Exemplary polynucleotides suitable for compositions of the present mvention are 
disclosed in SEQ ID NO:73, SEQ ID NO:74, SEQ ID NO:75, SEQ ID N0:1, SEQ ID 

20 N0:3, SEQ ID N0:5, SEQ ID N0:6, SEQ ID N0:7, SEQ ID NO:ll, SEQ ID N0:13, 
SEQ IDNO:15, SEQ ID NO:17, SEQ ID N0:19, SEQ ID NO:20, SEQ ID N0:21, SEQ 
ID NO:22, SEQ ID NO:23, SEQ ID NO:24, SEQ ID NO:30, SEQ ID NO:32, and SEQ 
ID NO:76. 

The present invention also provides kits that are suitable for performing 
25 the detection methods of the present invention. Exemplary kits comprise 
oligonucleotide primer pairs each one of which specifically hybridizes to a distinct 
polynucleotide. Within certain embodiments, kits according to the present invention 
may also comprise a nucleic acid polymerase and suitable buffer. Exemplary 
oligonucleotide primers suitable for kits of the present invention are disclosed herein by 
30 SEQ ID NOs: 33-71. Exemplary polynucleotides suitable for kits of the present 
invention are disclosed in SEQ ID N0.73, SEQ ID NO:74, SEQ ID NO:75, SEQ ID 
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NO:l, SEQ ID N0;3, SEQ ID N0:5, SEQ ID N0;6, SEQ ID N0;7, SEQ ID NOrll, 
SEQ ID N0:1 3, SEQ ID NO: 15, SEQ ID NO: 17, SEQ ID NO: 19, SEQ ID NO:20, SEQ 
ID N0:21, SEQ ID NO:22, SEQ ID NO:23, SEQ ID NO:24, SEQ ID NO:30, SEQ ID 
NO:32, and SEQ ID NO:76. 
5 These and other aspects of the present invention will beconnie apparent 

upon reference to the following detailed description and attached drawings. All 
references disclosed herein are hereby incorporated by reference in their entirety as if 
each was incorporated individually. 

BRIEF DESCRIPTION OF THE DRAWINGS AND SEQUENCE IDENTIFIERS 
10 Figure 1 shows the mRNA expression profiles for B3 1 ID, B533S and 

B726P as determined using quantitative PGR (Taqman™). Abbreviations: B.T.: Breast 
tumor; B.M.: Bone marrow; B.R.: Breast reduction. 

Figure 2 shows the relationship of B533S expression to pathological 
stage of tiunor. Tissues from normal breast (8), benign breast disorders (3), and breast 
15 tumors stage I (5), stage II (6), stage III (7), stage IV (3) and metastases (1 lymph node 
and 3 pleural effusions) were tested in real-time PGR. The data is expressed as the mean 
copies/ng p-actin for each group tested and the line is the calculated trend line. 

Figures 3 A and 3B show the gene complementation of B305D C-form, 
B726P, GABAtc and mammaglobin in metastases and primary tumors, respectively. 
20 The cut-off for each of the genes was 6.57, 1 .65, 4.58 and 3.56 copies/ng (i-Actin based 
on the mean of the negative normal tissues plus 3 standard deviations. 

Figure 4 shows the full-length cDNA sequence for mammaglobin. 
Figure 5 shows the determined cDNA sequence of the open reading 
frame encoding a mammaglobin recombinant polypeptide expressed in E. coli, 
25 Figure 6 shows the fiilHength cDNA sequence for GABAti. 

Figure 7 shows the mRNA expression levels for mammaglobin, 
GABAti, B305D (C form) and B726P in breast tumor and normal samples determined 
using real-time PCR and the SYBR detection system. Abbreviations: BT: Breast 
tumor; BR: Breast reduction; A. PBMC: Activated peripheral blood mononuclear cells; 
30 R. PBMC: resting PBMC; T. Gland: Thyroid gland; S. Cord: Spinal Cord; A. Gland: 
Adrenal gland; B. Marrow: Bone marrow; S. Muscle: Skeletal muscle. 
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Figure 8 is a bar graph showing a comparison between the LipophilinB 
alone and the LipophilinB-'B899P-B305D-C-B726 multiplex assays tested on a panel of 
breast tumor samples. Abbreviations: BT: Breast tumor; BR: Breast reduction; SCID: 
severe combined immunodeficiency. 
5 Figure 9 is a gel showing the unique band length of four amplification 

products of tumor genes of interest (mammaglobin, B305D, B899P, B726P) tested in a 
multiplex Real-time PCR assay. 

Figure 10 shows a comparison of a multiplex assay using intron-exon 
border spanning primers (bottom panel) and those using non-optimized primers (top 
10 panel), to detect breast cancer cells in a panel of lymph node tissues. 

SEQ ID NO: 1 is the determined cDNA sequence for a first splice 
variant of B305D isoform A. 

SEQ ID NO: 2 is the amino acid sequence encoded by the sequence of 
SEQ ID NO: 1. 

15 SEQ ID NO: 3 is the determined cDNA sequence for a second splice 

variant of B305D isoform A. 

SEQ ID NO: 4 is the amino acid sequence encoded by the sequence of 
SEQ ID NO: 3. 

SEQ ID NO: 5-7 are the determined cDNA sequences for three splice 
20 variants of B305D isoform C. 

SEQ ID NO: 8-10 are the amino acid sequences encoded by the 
sequence of SEQ ID NO: 5-7, respectively. 

SEQ ID NO: 1 1 is the determined cDNA sequence for B31 ID. 
SEQ ID NO: 12 is the amino acid sequence encoded by the sequence of 
25 SEQ ID NO: 11. 

SEQ ID NO: 13 is the determined cDNA sequence of a first splice 
variant of B726P. 

SEQ ID NO: 14 is the amino acid sequence encoded by the sequence of 
SEQ ID NO: 13. 

30 SEQ ID NO: 15 is the determined cDNA sequence of a second splice 

variant of B726P. 

SEQ ID NO: 16 is the amino acid sequence encoded by the sequence of 
SEQ ID NO: 15. 
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SEQ ID NO: 17 is the determined cDNA sequence of a third splice 
variant of B726P. 

SEQ ID NO: 1 8 is tlie amino acid sequence encoded by the sequence of 
SEQ ID NO: 17. 

5 SEQ ID NO: 19-24 are the determined cDNA sequences of further splice 

variants ofB726P. 

SEQ ID NO: 25-29 are the amino acid sequences encoded by SEQ ID 
NO: 19-24, respectively. 

SEQ ID NO: 30 is the determined cDNA sequence for B51 IS. 
10 SEQ ID NO: 3 1 is the amino acid sequence encoded by SEQ ID NO: 30. 

SEQ ID NO: 32 is the determined cDNA sequence for B533S. 
SEQ ID NO:33 is the DNA sequence of Lipophilin B forward primer. 
SEQ ID NO:34 is the DNA sequence of LipophiUn B reverse primer. 
SEQ ID NO:35 is the DNA sequence of Lipophilin B probe. 
1 5 SEQ ID NO:36 is the DNA sequence of GAB A (B899P) forward primer. 

SEQ ID NO:37 is the DNA sequence of GABA (B899P) reverse primer. 
SEQ ID NO:38 is the DNA sequence of GABA (B899P) probe. 
SEQ ID NO:39 is the DNA sequence of B305D (C form) forward 

primer. 

20 SEQ ID NO:40 is the DNA sequence of B305D (C form) reverse primer. 

SEQ ID N0:41 is the DNA sequence of B305D (C form) probe. 

SEQ ID NO:42 is the DNA sequence of B726P forward primer. 

SEQ ID NO:43 is the DNA sequence of B726P reverse primer. 

SEQ ID NO:44 is the DNA sequence of B726P probe. 
25 SEQ ID NO:45 is the DNA sequence of Actin forward primer. 

SEQ ID NO:46 is the DNA sequence of Actin reverse primer. 

SEQ ID NO:47 is tlie DNA sequence of Actin probe. 

SEQ ID NO:48 is the DNA sequence of Mammaglobin forward primer. 

SEQ ID NO:49 is the DNA sequence of Mammaglobin reverse primer. 
30 SEQ ID NO:50 is the DNA sequence of Mammaglobin probe. 

SEQ ID NO:51 is the DNA sequence of a second GABA (B899P) 

reverse primer. 

SEQ ID NO; 52 is the DNA sequence of a second B726P forward primer. 
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SEQ ID NO:53 is the DNA sequence of a GABA B899P-INT forward 

primer. 

SEQ ID NO:54 is the DNA sequence of a GABA B899P-INT reverse 

primer. 

5 SEQ ID NO:55 is the DNA sequence of a GABA B899P-INT Taqman 

probe. 

SEQ ID NO:56 is the DNA sequence of a B305D-INT forward primer. 

SEQ ID NO:57 is the DNA sequence of a B305D-INT reverse primer. 

SEQ ID NO:58 is the DNA sequence of a B305D-INT Taqman probe. 
10 SEQ ID NO:59 is the DNA sequence of a B726-INT forward primer. 

SEQ ID NO:60 is the DNA sequence of a B726-INT reverse primer. 

SEQ ID N0:61 is the DNA sequence of a B726-INT Taqman probe. 

SEQ ID NO:62 is the DNA sequence of a GABA B899P Taqman probe. 

SEQ ID NO:63 is the DNA sequence of a B3 1 1 D forward primer. 
15 SEQ ID NO :64 is the DNA sequence of a B3 11 D reverse primer. 

SEQ ID NO:65 is the DNA sequence of a B3 1 1 D Taqman probe. 

SEQ lb NO:66 is the DNA sequence of a B533S forward primer. 

SEQ ID NO:67 is the DNA sequence of a B533S reverse primer. 

SEQ ID NO:68 is the DNA sequence of a B533S Taqman probe. 
20 SEQ ID NO:69 is the DNA sequence of a B5 11 S forward primer. 

SEQ ID NO:70 is the DNA sequence of a B5 1 1 S reverse primer. 

SEQ ID N0:7 1 is the DNA sequence of a B5 1 1 S Taqman probe. 

SEQ ID NO:72 is the DNA sequence of a GABAtx reverse primer. 

SEQ ED NO:73 is the full-length cDNA sequence for mammaglobin, 
25 SEQ ID NO:74 is the determined cDNA sequence of the open reading 

frame encoding a mammaglobin recombinant polypeptide expressed in E, coli. 

SEQ ID NO:75 is the full-length cDNA sequence for GABAtt. 

SEQ ID NO:76 is the full-length cDNA sequence for lipophilin B. 

SEQ ID NO:77 is the amino acid sequence encoded by the sequence of 
30 SEQIDNO:76. 
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DETAIUED DESCRIPTION OF THE INVENTION 

As noted above, the present invention is directed generally to methods 
that are suitable for the identification of tissue-specific polynucleotides as as to 
methods, compositions and kits that are suitable for the diagnosis and monitoring of 
5 cancer. While certain exemplary methods, compositions and kits disclosed herein are 
directed to the identification, detection and monitoring of breast cancer, in particular 
breast cancer-specific polynucleotides, it will be understood by those of skill in the art 
that the present invention is generally applicable to the identification, detection and 
monitoring of a wide variety of cancers, and the associated over-expressed 

10 polynucleotides, including, for example, prostate cancer, breast cancer, colon cancer, 
ovarian cancer, lung cancer, head & neck cancer, lymphoma, leukemia, melanoma, 
liver cancer, gastric cancer, kidney cancer, bladder cancer, pancreatic cancer and 
endometrial cancer. Thus, it will be apparent that the present invention is not limited 
solely , to the identification of breast cancer-specific polynucleotides or to the detection 

1 5 and monitoring of breast cancer. 

Identification of Tissue-specific Polynucleotides 

Certain embodiments of the present invention provide methods, 
compositions and kits for the detection of a cancer cell within a biological sample. 

20 These methods comprise the step of detecting one or more tissue-specific 
polynucleotide(s) firom a patient's biological sample the over-expression of which 
polynucleotides indicates the presence of a cancer cell within the patient's biological 
sample. Accordingly, the present invention also provides methods that are suitable for 
the identification of tissue-specific polynucleotides. As used herein, the phrases 

25 "tissue-specific polynucleotides" or "tumor-specific polynucleotides" are meant to 
include all polynucleotides that are at least two-fold over-expressed as compared to one 
or more control tissues. As discussed in fiirther detail herein below, over-expression of 
a given polynucleotide may be assessed, for example, by microarray and/or quantitative 
real-time polymerase chain reaction (Real-time PCR™) methodologies. 

30 Exemplary methods for detecting tissue-specific polynucleotides may 

comprise the steps of: (a) performing a genetic subtraction to identify a pool of 
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polynucleotides from a tissue of interest; (b) performing a DNA microarray analysis to 
identify a first subset of said pool of polynucleotides of interest wherein each member 
polynucleotide of said first subset is at least two-fold over-expressed in said tissue of 
interest as compared to a control tissue; and (c) performing a quantitative polymerase 
5 chain reaction analysis on polynucleotides within said first subset to identify a second 
subset of polynucleotides that are at least two-fold over-expressed as compared to said 
control tissue. 

Polynucleotides Generally 

As used herein, the term "polynucleotide" refers generally to either DNA 
10 or RNA molecules. Polynucleotides may be naturally occurring as normally found in a 
biological sample such as blood, serum, lymph node, bone marrow, sputum, urine and 
tumor biopsy samples. Alternatively, polynucleotides may be derived synthetically by, 
for example, a nucleic acid polymerization reaction. As will be recognized by the 
skilled artisan, polynucleotides may be single-stranded (coding or antisense) or double- 
15 stranded, and may be DNA (genomic, cDNA or synthetic) or RNA molecules. RNA 
molecules include HnRNA molecules, which contain introns and correspond to a DNA 
molecule in a one-to-one manner, and mRNA molecules, which do not contain introns. 
Additional coding or non-coding sequences may, but need not, be present within a 
polynucleotide of the present invention, and a polynucleotide may, but need not, be 
20 linked to other molecules and/or support materials. 

Polynucleotides may comprise a native sequence (i.e. an endogenous 
sequence that encodes a tumor protein, such as a breast tumor protein, or a portion 
thereof) or may comprise a variant, or a biological or antigenic functional equivalent of 
such a sequence. Polynucleotide variants may contain one or more substitutions, 
25 additions, deletions and/or insertions, as further described below. The term "variants" 
also encompasses homologous genes of xenogenic origin. 

When comparing polynucleotide or polypeptide sequences, two 
sequences are said to be "identical" if the sequence of nucleotides or amino acids in the 
two sequences is the same when aligned for maximum correspondence, as described 
30 below. Comparisons between two sequences are typically performed by comparing the 
sequences over a comparison window to identify and compare local regions of sequence 
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similarity. A "comparison window" as used herein, refers to a segment of at least about 
20 contiguous positions, usually 30 to about 75, 40 to about 50, in which a sequence 
may be compared to a reference sequence of the same number of contiguous positions 
after the two sequences ai e optimally aligned. 
5 Optimal alignment of sequences for comparison may be conducted using 

the Megalign program in the Lasergene suite of bioinformatics software (DNASTAR, 
Inc., Madison, WI), using default parameters. This program embodies several 
alignment schemes described in the following references: Dayhoff, M.O. (1978) A 
model of evolutionary change in proteins - Matrices for detecting distant relationships. 

10 In Dayhoff, M.O. (ed.) Atlas of Protein Sequence and Structure, National Biomedical 
Research Foundation, Washington DC Vol. 5, Suppl. 3, pp. 345-358; Hein J. (1990) 
Unified Approach to Alignment and Phylogenes pp. 626-645 Methods in Enzymology 
vol. 183, Academic Press, Inc., San Diego, CA; Higgins, D.G. and Sharp, P.M. (1989) 
CABIOS 5:151-153; Myers, E.W. and Muller W. (1988) CABIOS 4:11-17; Robinson, 

15 E.D. (1971) Comb, Theor 77:105; Santou, N. Nes, M. (1987) Mol Biol EvoL 4:406- 
425; Sneath, P.H.A. and Sokal, R.R. (1973) Numerical Taxonomy -the Principles and 
Practice of Numerical Taxonomy Freeman Press, San Francisco, CA; Wilbur, W.J. and 
Lipman, D.J. (1983) Proc, Natl Acad, Set USA 50:726-730. 

Alternatively, optimal alignment of sequences for comparison may be 

20 conducted by the local identity algorithm of Smith and Waterman (1981) Add. APL. 
Math 2:482, by the identity alignment algorithm of Needleman and Wunsch (1970) J. 
Mol Biol 48:443, by the search for similarity methods of Pearson and Lipman (1988) 
Proc. Natl Acad. Set USA 85: 2444, by computerized implementations of these 
algorithms (GAP, BESTFIT, BLAST, FASTA, and TFASTA in the Wisconsin Genetics 

25 Software Package, Genetics Computer Group (GCG), 575 Science Dr., Madison, WI), 
or by inspection. 

One preferred example of algorithms that are suitable for determining 
percent sequence identity and sequence similarity are the BLAST and BLAST 2.0 
algorithms, which are described in Altschul et al {\911)Nucl Acids Res. 25:3389-3402 
30 and Altschul et al (1990) J. Mol Biol 215:403-410, respectively. BLAST and BLAST 
2.0 can be used, for example with the parameters described herein, to determine percent 



wo 01/75171 



PCTAJSOl/10631 



1 



16 

sequence identity for the polynucleotides and polypeptides of the invention. Software 
for perfoiming BLAST analyses is publicly available through the National Center for 
Biotechnology Infoimation. In one illustrative example, cumulative scores can be 
calculated using, for nucleotide sequences, the parameters M (reward score for a pair of 
5 matching residues; always >0) and N (penalty score for mismatching residues; always 
<0). For amino acid sequences, a scoring matrix can be used to calculate the 
cumulative score. Extension of the word hits in each direction are halted when: the 
cumulative alignment score falls off by the quantity X from its maximum achieved 
value; the cumulative score goes to zero or below, due to the accumulation of one or 
10 more negative-scoring residue alignments; or the end of either sequence is reached. 
The BLAST algorithm parameters W, T and X determine the sensitivity and speed of 
the alignment. The BLASTN program (for nucleotide sequences) uses as defauhs a 
wordlength (W) of 1 1, and expectation (E) of 10, and the BLOSUM62 scoring matrix 
(see Henikoff and Henikoff (1989) Proc, Natl Acad. Scl USA 89:10915) alignments, 
1 5 (B) of 50, expectation (E) of 10, M==5, N=-4 and a comparison of both strands. 

Preferably, the "percentage of sequence identity" is determined by 
comparing two optimally aligned sequences over a window of comparison of at least 20 
positions, wherein the portion of the polynucleotide or polypeptide sequence in the 
comparison window may comprise additions or deletions (f.e, gaps) of 20 percent or 
20 less, usually 5 to 15 percent, or 10 to 12 percent, as compared to the reference 
sequences (which does not comprise additions or deletions) for optimal alignment of the 
two sequences. The percentage is calculated by determining the number of positions at 
which the identical nucleic acid bases or amino acid residue occurs in both sequences to 
yield the number of matched positions, dividing the number of matched positions by the 
25 total number of positions in the reference sequence {i,e., the window size) and 
multiplying the results by 100 to yield the percentage of sequence identity. 

Therefore, the present invention encompasses polynucleotide and 
polypeptide sequences having substantial identity to the sequences disclosed herein, for 
example those comprising at least 50% sequence identity, preferably at least 55%, 60%, 
30 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, or 99% or higher, sequence 
identity compared to a polynucleotide or polypeptide sequence of this invention using 
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the methods described herein, (e.g., BLAST analysis using standard parameters, as 
described below). One skilled in this art will recognize that these values can be 
appropriately adjusted to determine corresponding identity of proteins encoded by two 
nucleotide sequences by taking into account codon degeneracy, amino acid similarity, 
5 reading frame positioning and the like. 

In additional embodiments, the present invention provides isolated 
polynucleotides and polypeptides comprising various lengths of contiguous stretches of 
sequence identical to or complementary to one or more of the sequences disclosed 
herein. For example, polynucleotides are provided by this invention that comprise at 

10 least about 15, 20, 30, 40, 50, 75, 100, 150, 200, 300, 400, 500 or 1000 or more 
contiguous nucleotides of one or more of the sequences disclosed herein as well as all 
intermediate lengths there between. It will be readily understood that "intermediate 
lengths", in this context, means any length between the quoted values, such as 16, 17, 
18, 19, etc,\ 21, 22, 23, etc.\ 30, 31, 32, etc.\ 50, 51, 52, 53, etc,\ 100, 101, 102, 103, 

15 etc,\ 150, 151, 152, 153, etc.\ including all integers through 200-500; 500-1,000, and the 
like. 

The polynucleotides of the present invention, or fragments thereof, 
regardless of the length of the coding sequence itself, may be combined with other 
DNA sequences, such as promoters, polyadenylation signals, additional restriction 

20 enzyme sites, multiple cloning sites, other coding segments, and the like, such that their 
overall length may vary considerably. It is therefore contemplated that a nucleic acid 
fragment of almost any length may be employed, with the total length preferably being 
limited by the ease of preparation and use in the intended recombinant DNA protocol. 
For example, illustrative DNA segments with total lengths of about 10,000, about 5000, 

25 about 3000, about 2,000, about 1,000, about 500, about 200, about 100, about 50 base 
pairs in length, and the like, (including all intermediate lengths) are contemplated to be 
usefiil in many implementations of this invention. 

In other embodiments, the present invention is directed to 
polynucleotides that are capable of hybridizing under moderately stringent conditions to 

30 a polynucleotide sequence provided herein, or a fragment thereof, or a complementary 
sequence thereof. Hybridization techniques are well known in the art of molecular 
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biology. For purposes of illustration, suitable moderately stringent conditions for 
testing the hybridization of a polynucleotide of this invention with other 
polynucleotides include prewashing in a solution of 5 X SSC, 0.5% SDS, 1.0 mM 
EDTA (pH 8.0); hybridizing at 50^C-65X, 5 X SSC, overnight; followed by washing 
5 twice at 65^C for 20 minutes with each of 2X, 0.5X and 0.2X SSC containing 0.1% 
SDS. 

Moreover, it will be appreciated by those of ordinary skill in the art that, 
as a result of the degeneracy of the genetic code, there are many nucleotide sequences 
that encode a polypeptide as described herein. Some of these polynucleotides bear 

10 minimal homology to the nucleotide sequence of any native gene. Nonetheless, 
polynucleotides that vary due to differences in codon usage are specifically 
contemplated by the present invention. Further, alleles of the genes comprising the 
polynucleotide sequences provided herein are within the scope of the present invention. 
Alleles are endogenous genes that are altered as a result of one or more mutations, such 

15 as deletions, additions and/or substitutions of nucleotides. The resulting mRNA and 
protein may, but need not, have an altered structure or function. Alleles may be 
identified using standard techniques (such as hybridization, amplification and/or 
database sequence comparison). 

Microarray Analyses 

20 Polynucleotides that are suitable for detection according to the methods 

of 

the present invention may be identified, as described in more detail below, by screening 
a microarray of cDNAs for tissue and/or tumor-associated expression (e.g., expression 
that is at least two-fold greater in a tumor than in normal tissue, as determined using a 
25 representative assay provided herein). Such screens may be performed, for example, 
using a Synteni microarray (Palo Alto, CA) according to the manufacturer's instructions 
(and essentially as described by Schena et al, Proc, Natl Acad. Sci, USA 93:10614- 
10619 (1996) and Heller et al, Proc. Natl Acad Set USA 94:2150-2155 (1997)). 



30 Microarray is an effective method for evaluating large numbers of genes 

but due to its limited sensitivity it may not accurately determine the absolute tissue 
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distribution of low abundance genes or may underestimate the degree of overexpression 
of more abundant genes due to signal saturation. For those genes showing 
overexpression by microarray expression profiling, ftirther analysis was performed 
using quantitative RT-PCR based on Taqman"^"^ probe detection, which comprises a 
5 gieater dynamic range of sensitivity. Several different panels of normal and tumor 
tissues, distant metastases and cell lines were used for this purpose. 

Quantitative Real-time Polymerase Chain Reaction 
Suitable polynucleotides according to the present invention may be 
further characterized or, alternatively, originally identified by employing a quantitative 

10 PGR methodology such as, for example, the Real-time PGR methodology. By this 
methodology, tissue and/or tumor samples, such as, e.g., metastatic timior samples, may 
be tested along side the corresponding normal tissue sample and/or a panel of unrelated 
normal tissue samples. 

Real-time PGR {see Gibson et al.. Genome Research 6:995-1001, 1996; 

15 Heid et al.. Genome Research 6:986-994, 1996) is a technique that evaluates the level 
of PGR product accumulation during amplification. This technique permits quantitative 
evaluation of mRNA levels in multiple samples. Briefly, mRNA is extracted firom 
tumor and normal tissue and cDNA is prepared using standard techniques. 

Real-time PGR may, for example, be performed either on the ABI 7700 

20 Prism or on a GeneAmp® 5700 sequence detection system (PE Biosystems, Foster 
Gity, GA). The 7700 system uses a forward and a reverse primer in combination with a 
specific probe v^th a 5' fluorescent reporter dye at one end and a 3' quencher dye at the 
other end (Taqman™). When the Real-time PGR is performed using Taq DNA 
polymerase with 5'-3' nuclease activity, the probe is cleaved and begins to fluoresce 

25 allowing the reaction to be monitored by the increase in fluorescence (Real-time). The 
5700 system uses SYBR® green, a fluorescent dye, that only binds to double stranded 
DNA, and the same forward and reverse primers as the 7700 instrument. Matching 
primers and fluorescent probes may be designed according to the primer express 
program (PE Biosystems, Foster Gity, GA). Optimal concentrations of primers and 

30 probes are initially determined by those of ordinary skill in the art. Gontrol (e.g., P- 
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actin) primers and probes may be obtained commercially from, for example, Perkin 
Elmer/Applied Biosystems (Foster City, CA). 

To quantitate the amoimt of specific RNA in a sample, a standard curve 

is 

5 generated using a plasmid containing the gene of interest. Standard curves are 
generated using the Ct values determined in the real-time PGR, which are related to the 
initial cDNA concentration used in the assay. Standard dilutions ranging from 10-10^ 
copies of the gene of interest are generally sufficient. In addition, a standard curve is 
generated for the control sequence. This permits standardization of initial RNA content 
10 of a tissue sample to the amoimt of control for comparison purposes. 

In accordance with the above, and as described further below, the 

present 

invention provides the illustrative breast tissue- and/or tumor-specific polynucleotides 
mammaglobin, lipophilin B, GABAti (B899P), B726P, B511S, B533S, B305D and 

15 B31 ID having sequences set forth in SEQ ID NO: 1, 3, 5-7, 1 1, 13, 15, 17, 19-24, 30, 
32, and 73-76 illustrative polypeptides encoded thereby having amino acid sequences 
set forth in SEQ ID NO: 2, 4, 8-10, 12, 14, 16, 18, 25-29 and 31 and 77 that may be 
suitably employed in the detection of cancer, more specifically, breast cancer. 

The methods disclosed herein will also permit the identification of 

20 additional and/or alternative polynucleotides that are suitable for the detection of a wide 
range of cancers including, but not limited to, prostate cancer, breast cancer, colon 
cancer, ovarian cancer, lung cancer head & neck cancer, lymphoma, leukemia, 
melanoma, liver cancer, gastric cancer, kidney cancer, bladder cancer, pancreatic cancer 
and endometrial cancer. 

25 

Methodologies for the Detection of Gancer 

In general, a cancer cell may be detected in a patient based on the 
presence of one or more polynucleotides within cells of a biological sample (for 
example, blood, lymph nodes, bone marrow, sera, sputum, urine and/or tumor biopsies) 
30 obtained from the patient. In other words, such polynucleotides may be used as 
markers to indicate the presence or absence of a cancer such as, e.g,, breast cancer. 
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As discussed in further detail herein, the present invention achieves these 
and other related objectives by providing a methodology for the simultaneous detection 
of more than one polynucleotide, the presence of which is diagnostic of the presence of 
cancer cells in a biological sample. Each of the various cancer detection methodologies 
5 disclosed herein have in common a step of hybridizing one or more oligonucleotide 
primers and/or probes, the hybridization of which is demonstrative of the presence of a 
tumor- and/or tissue-specific polynucleotide. Depending on the precise application 
contemplated, it may be preferred to employ one or more intron-spanning 
oligonucleotides that are inoperative against polynucleotide of genomic DNA and, thus, 

10 these oligonucleotides are effective in substantially reducing and/or eliminating the 
detection of genomic DNA in the biological sample. 

Further disclosed herein are methods for enhancing the sensitivity of 
these detection methodologies by subjecting the biological samples to be tested to one 
or more cell capture and/or cell depletion methodologies. 

15 By certain embodiments of the present invention, the presence of a 

cancer cell in a patient may be determined by employing the following steps: (a) 
obtaining a biological sample from said patient; (b) contacting said biological sample 
with a first oligonucleotide that hybridizes to a first polynucleotide said first 
polynucleotide selected from the group consisting of polynucleotides depicted in SEQ 

20 ID NO:73 and SEQ ID NO:74; (c) contacting said biological sample with a second 
oligonucleotide that hybridizes to a second polynucleotide selected from the group 
consisting of SEQ ID NO: 1, 3, 5-7, 11, 13, 15, 17, 19-24, 30, 32, and 75; (d) 
detecting in said sample an amount of a polynucleotide that hybridizes to at least 
one of the oligonucleotides; and (e) comparing the amount of the polynucleotide that 

25 hybridizes to said oligonucleotide to a predetermined cut-off value, and therefrom 
determining the presence or absence of a cancer in the patient. 

Alternative embodiments of the present invention provide methods 
wherein the presence of a cancer cell in a patient is determined by employing the steps 
of: (a) obtaining a biological sample from said patient; (b) contacting said biological 

30 sample with a first oligonucleotide that hybridizes to a first polynucleotide said first 
polynucleotide depicted in SEQ ID NO:76; (c) contacting said biological sample with a 
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second oligonucleotide that hybridizes to a second polynucleotide selected from the 
group consisting of SEQ ID NO: 1, 3, 5-7, 11, 13, 15, 17, 19-24, 30, 32, and 75; (d) 
detecting in said sample an amount of a polynucleotide that hybridizes to at least one of 
the oligonucleotides; and (e) comparing the amount of the polynucleotide that 
5 hybridizes to said oligonucleotide to a predetermined cut-off value, and therefrom 
determining the presence or absence of a cancer in the patient. 

Other embodiments of the present invention provide methods for 
determining the presence or absence of a cancer in a patient. Such methods comprise 
the steps of: (a) obtaining a biological sample from said patient; (b) contacting said 

10 biological sample obtained from a patient with a first oligonucleotide that hybridizes to 
a polynucleotide sequence selected from the group consisting of polynucleotides 
depicted in SEQ ID NO:73, SEQ ID NO:74 and SEQ ID NO:76; (c) contacting said 
biological sample with a second oligonucleotide that hybridizes to a polynucleotide as 
depicted in SEQ ID NO:75; (d) contacting said biological sample with a third 

15 oligonucleotide that hybridizes to a polynucleotide selected from the group consisting 
of polynucleotides depicted in SEQ ID N0:5, SEQ ID N0:6 and SEQ ID N0:7; (e) 
contacting said biological sample with a fourth oligonucleotide that hybridizes 
to a polynucleotide selected from the group consisting of polynucleotides depicted in 
SEQ ID N0:13, SEQ ID N0:15, SEQ ID N0:17, SEQ ID N0:19, SEQ ED NO:20; SEQ 

20 ID N0:21 , SEQ ID N0:22, SEQ ID NO:23 and SEQ ID N0:24; (f)detecting in said 
biological sample an amount of a polynucleotide that hybridizes to at least one of said 
oligonucleotides; and (g) comparing the amount of polynucleotide that hybridizes to the 
oligonucleotide to a predetermined cut-off value, and therefrom determining the 
presence or absence of a cancer in the patient. 

25 To permit hybridization under assay conditions, oligonucleotide primers 

and probes should comprise an oligonucleotide sequence that has at least about 60%, 
preferably at least about 75% and more preferably at least about 90%, identity to a 
portion of a polynucleotide encoding a breast tumor protein that is at least 10 
nucleotides, and preferably at least 20 nucleotides, in length. Preferably, 

30 oligonucleotide primers hybridize to a polynucleotide encoding a polypeptide described 
herein under moderately stringent conditions, as defined above. Oligonucleotide 
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primers which may be usefully employed in the diagnostic methods described herein 
preferably are at least 10-40 nucleotides in length. In a preferred embodiment, the 
oh'gonucleotide primers comprise at least 10 contiguous nucleotides, more preferably at 
least 15 contiguous nucleotides, of a DNA molecule having a sequence recited in SEQ 
5 ID NO: 1, 3, 5-7, 11, 13, 15, 17, 19-24, 30, 32 and 73-76. Techniques for both PGR 
based assays and hybridization assays are well known in the art {see^ for example, 
MuUis et al. Cold Spring Harbor Symp. Quant, BioL, 57:263, 1987; Eriich ed., PCR 
Technology, Stockton Press, NY, 1989). 

The present invention also provides amplification-based methods for 

10 detecting the presence of a cancer cell in a patient. Exemplary methods comprise the 
steps of (a) obtaining a biological sample from a patient; (b) contacting the biological 
sample with a first oligonucleotide pair the first pair comprising a first oligonucleotide 
and a second oligonucleotide wherein the first oligonucleotide and the second 
oligonucleotide hybridize to a first polynucleotide and the complement thereof, 

15 respectively; (c) contacting the biological sample Avith a second oligonucleotide pair the 
second pair comprising a third oligonucleotide and a fourth oligonucleotide wherein the 
third and the fourth oligonucleotide hybridize to a second polynucleotide and the 
complement thereof, respectively, and wherein the first polynucleotide is unrelated in 
nucleotide sequence to tlie second polynucleotide; (d) amplifying the first 

20 polynucleotide and the second polynucleotide; and (e) detecting the amplified first 
polynucleotide and the amplified second polynucleotide; wherein the presence of the 
amplified first polynucleotide or the amplified second polynucleotide indicates the 
presence of a cancer cell in the patient. 

Methods according to the present invention are suitable for identifying 

25 polynucleotides obtained fi:om a wide variety of biological sample such as, for example, 
blood, serum, lymph node, bone marrow, sputum, urine and tumor biopsy sample. In 
certain preferred embodiments, the biological sample is either blood, a lymph node or 
bone marrow. In other embodiments of the present invention, the lymph node may be a 
sentinel lymph node. 

30 It will be apparent that the present methods may be employed in the 

detection of a wide variety of cancers. Exemplary cancers include, but are not limited 
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to, prostate cancer, breast cancer, colon cancer, ovarian cancer, lung cancer head & 
neck cancer, lymphoma, leukemia, melanoma, liver cancer, gastric cancer, kidney 
cancer, bladder cancer, pancreatic cancer and endometrial cancer. 

Certain exemplary embodiments of the present invention provide 

5 methods wherein the polynucleotides to be detected are selected from the group 
consisting of mammaglobin, lipophihn B, GABAtt (B899P), B726P, B511S, B533S, 
B305D and B311D. Alternatively and/or additionally, polynucleotides to be detected 
may be selected from the group consisting of those depicted in SEQ ID NO:73, SEQ ID 
NO:74, SEQ ID NO:75, SEQ ID NO:l, SEQ ID NO:3, SEQ ID N0:5, SEQ ID NO:6, 

10 SEQ ID N0:7, SEQ ID N0:11, SEQ ID N0:13, SEQ ID NO:15, SEQ ID NO:17, SEQ 
ID N0:19, SEQ ID NO:20, SEQ ID N0:21, SEQ ID NO:22, SEQ ID NO:23, SEQ ID 
NO:24, SEQ ID NO:30, SEQ ID NO:32, and SEQ ID NO:76. 

Suitable exemplary oligonucleotide probes and/or primers that may be 
used according to the methods of the present invention are disclosed herein by SEQ ID 

15 N0s:33-35 and 63-72. In certain preferred embodiments that eliminate the background 
detection of genomic DNA, the oligonucleotides may be intron spanning 
oligonucleotides. Exemplary intron spanning oligonucleotides suitable for the 
detection of various polynucleotides disclosed herein are depicted in SEQ ID NOs:36- 
62. 

20 Depending on the precise application contemplated, the artisan may prefer to 

detect the tissue- and/or tumor-specific polynucleotides by detecting a radiolabel and 
detecting a fluorophore. More specifically, the oligonucleotide probe and/or primer 
may comprises a detectable moiety such as, for example, a radiolabel and/or a 
fluorophore. 

25 Alternatively or additionally, methods of the present invention may also 

comprise a step of fiactionation prior to detection of the tissue- and/or tumor-specific 

polynucleotides such as, for example, by gel electrophoresis. 

In other embodiments, methods described herein may be used as to 

monitor the progression of cancer. By these embodiments, assays as provided for the 
30 diagnosis of a cancer may be performed over time, and the change in the level of 

reactive polypeptide(s) or polynucleotide(s) evaluated. For example, the assays may be 
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performed every 24-72 hours for a period of 6 months to 1 year, and thereafter 
performed as needed. In general, a cancer is progressing in those patients in whom the 
level of polypeptide or polynucleotide detected increases over time. In contrast, the 
cancer is not progressing when the level of reactive polypeptide or polynucleotide either 
5 remains constant or decreases with time. 

Certain m vivo diagnostic assays may be performed directly on a tumor. 
One such assay involves contacting tumor cells with a binding agent. The bound 
binding agent may then be detected directly or indirectly via a reporter group. Such 
binding agents may also be used in histological applications. Alternatively, 

1 0 polynucleotide probes may be used within such applications. 

As noted above, to improve sensitivity, multiple breast tumor protein 
markers may be assayed within a given sample. It wAl be apparent that binding agents 
specific for different proteins provided herein may be combined within a single assay. 
Further, multiple primers or probes may be used concurrently. The selection of tumor 

15 protein markers may be based on routine experiments to determine combinations that 
results in optimal sensitivity. In addition, or alternatively, assays for tumor proteins 
provided herein may be combined with assays for other knovsm tumor antigens. 
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Cell Enrichment 

In other aspects of the present invention, cell capture technologies may 
be used prior to polynucleotide detection to improve the sensitivity of the various 
5 detection methodologies disclosed herein. 

Exemplary cell enrichrhent methodologies employ immunomagnetic 
beads that are coated with specific monoclonal antibodies to surface cell markers, or 
tetrameric antibody complexes, may be used to first enrich or positively select cancer 
cells in a sample. Various commercially available kits may be used, including 

10 Dynabeads® Epithelial Enrich (Dynal Biotech, Oslo, Norway), StemSep™ (StemCell 
Technologies, Lie, Vancouver, BC), and RosetteSep (StemCell Technologies). The 
skilled artisan will recognize that other readily available methodologies and kits may 
also be suitably employed to enrich or positively select desired cell populations. 

Dynabeads® Epithelial Enrich contains magnetic beads coated with 

15 mAbs specific for two glycoprotein membrane antigens expressed on normal and 
neoplastic epithelial tissues. The coated beads may be added to a sample and the 
sample then applied to a magnet, thereby capturing the cells bound to the beads. The 
unwanted cells are washed away and the magnetically isolated cells eluted from the 
beads and used in further analyses. 

20 RosetteSep can be used to enrich cells directly from a blood sample and 

consists of a cocktail of tetrameric antibodies that target a variety of unwanted cells and 
crosslinks them to glycophorin A on red blood cells (RBC) present in the sample, 
forming rosettes. When centrifuged over FicoU, targeted cells pellet along with the free 
RBC. 

25 The combination of antibodies in the depletion cocktail determines 

which cells will be removed and consequently which cells will be recovered. 
Antibodies that are available include, but are not limited to: CD2, CD3, CD4, CDS, 
CDS, CDIO, CDllb, CD14, CD15, CD16, CD19, CD20, CD24, CD25, CD29, CD33, 
CD34, CD36, CD38, CD41, CD45, CD45RA, CD45RO, CD56, CD66B, CD66e, HLA- 

30 DR, IgE, and TCRap. Additionally, it is contemplated in the present invention that 
mAbs specific for breast tumor antigens, can be developed and used in a similar 
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maimer. For example, mAbs that bind to tumor-specific cell surface antigens may be 
conjugated to magnetic beads, or formulated in a tetrameric antibody complex, and used 
to enrich or positively select metastatic breast tumor cells from a sample. 

Once a sample is enriched or positively selected, cells may be further 
5 analysed. For example, the cells may be lysed and RNA isolated. RNA may then be 
subjected to RT-PCR analysis using breast tumor-specific multiplex primers in a Real- 
time PGR assay as described herein. 

In another aspect of the present invention, cell capture technologies may 
be used in conjunction with Real-time PGR to provide a more sensitive tool for 
10 detection of metastatic cells expressing breast tumor antigens. Detection of breast 
cancer cells in bone marrov^ samples, peripheral blood, and small needle aspiration 
samples is desirable for diagnosis and prognosis in breast cancer patients. 

Probes and Primers 

As noted above and as described in further detail herein, certain 
15 methods, compositions and kits according to the present invention utilize two or more 
oligonucleotide primer pairs for the detection of cancer. The ability of such nucleic 
acid probes to specifically hybridize to a sequence of interest will enable them to be of 
use in detecting the presence of complementary sequences in a biological sample. 

Alternatively, in other embodiments, the probes and/or primers of the 
20 present invention may be employed for detection via nucleic acid hybridization. As 
such, it is contemplated that nucleic acid segments that comprise a sequence region of 
at least about 15 nucleotide long contiguous sequence that has the same sequence as, or 
is complementary to, a 15 nucleotide long contiguous sequence of a polynucleotide to 
be detected will find particular utility. Longer contiguous identical or complementary 
25 sequences, e.g., those of about 20, 30, 40, 50, 100, 200, 500, 1000 (including all 
intermediate lengths) and even up to full length sequences will also be of use in certain 
embodiments. 

Oligonucleotide primers having sequence regions consisting of 
contiguous nucleotide stretches of 10-14, 15-20, 30, 50, or even of 100-200 nucleotides 
30 or so (including intermediate lengths as well), identical or complementary to a 
polynucleotide to be detected , are particularly contemplated as primers for use in 
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amplification reactions such as, e.g., the polymerase chain reaction (PGR™).. This 
would allow a polynucleotide to be analyzed, both in diverse biological samples such 
as, for example, blood, lymph nodes and bone marrow. . 

The use of a primer of about 15-25 nucleotides in length allows the 

5 formation of a duplex molecule that is both stable and selective. Molecules having 
contiguous complementary sequences over stretches greater than 15 bases in length are 
generally preferred, though, in order to increase stability and selectivity of the hybrid, 
and thereby improve the quality and degree of specific hybrid molecules obtained. One 
will generally prefer to design primers having gene-complementary stretches of 15 to 

10 25 contiguous nucleotides, or even longer where desired. 

Primers may be selected from any portion of the polynucleotide to be 
detected. All that is required is to review the sequence, such as those exemplary 
polynucleotides set forth in SEQ ID NO: 1, 3, 5-7, 11, 13, 15, 17, 19-24, 30, 32, 73-75 
(Figures 3-6, respectively) and SEQ ID NO:76 (lipophilin B) or to any continuous 

15 portion of the sequence, from about 15-25 nucleotides in length up to and including the 
full length sequence, that one wishes to utilize as a primer. The choice of primer 
sequences may be governed by various factors. For example, one may wish to employ 
primers from towards the termini of the total sequence. The exemplary primers 
disclosed herein may optionally be used for their ability to selectively form duplex 

20 molecules with complementary stretches of the entire polynucleotide of interest such as 
those set forth in SEQ ID NO: 1, 3, 5-7, 11, 13, 15, 17, 19-24, 30, 32, 73-75 (Figures 3- 
6, respectively), and SEQ ID NO:76 (lipophilin B). 

The present invention further provides the nucleotide sequence of 
various exemplary oligonucleotide primers and probes, set forth in SEQ ID NOs: 33-71, 

25 that may be used, as described in further detail herein, according to the methods of the 
present invention for the detection of cancer. 

Oligonucleotide primers according to the present invention may be 
readily prepared routinely by methods commonly available to the skilled artisan 
including, for example, directly synthesizing the primers by chemical means, as is 

30 commonly practiced using an automated oligonucleotide synthesizer. Depending on the 
application envisioned, one will typically desire to employ varying conditions of 
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hybridization to achieve varying degrees of selectivity of probe towards target 
sequence. For applications requiring high selectivity, one will typically desire to 
employ relatively stringent conditions to form the hybrids, e.g., one will select 
relatively low salt and/or high temperature conditions, such as provided by a salt 
5 concentration of from about 0.02 M to about 0.15 M salt at temperatures of from about 
50°C to about 70°C. Such selective conditions tolerate little, if any, mismatch between 
the probe and the template or target strand, and would be particularly suitable for 
isolating related sequences. 

10 Polynucleotide Amplification Techniques 

Each of the specific embodiments outlined herein for the detection of 
cancer has in common the detection of a tissue- and/or tumor-specific polynucleotide 
via the hybridization of one or more oligonucleotide primers and/or probes. Depending 
on such factors as the relative number of cancer cells present in the biological sample 

15 and/or the level of polynucleotide expression within each cancer cell, it may be 
preferred to perform an amplification step prior to performing the steps of detection. 
For example, at least two oligonucleotide primers may be employed in a poljmnierase 
chain reaction (PGR) based assay to amplify a portion of a breast tumor cDNA derived 
from a biolo^cal sample, wherein at least one of the oligonucleotide primers is specific 

20 for {Le., hybridizes to) a polynucleotide encoding the breast tumor protein. The 
amplified cDNA may optionally be subjected to a fractionation step such as, for 
example, gel electrophoresis. 

A number of template dependent processes are available to amplify the 
target sequences of interest present in a sample. One of the best known amplification 

25 methods is the polymerase chain reaction (PGR™) which is described in detail in U.S. 
Patent Nos. 4,683,195, 4,683,202 and 4,800,159, each of which is incorporated herein 
by reference in its entirety. Briefly, in PGR™, two primer sequences are prepared 
which are complementary to regions on opposite complementary strands of the target 
sequence. An excess of deoxynucleoside triphosphates is added to a reaction mixture 

30 along with a DNA polymerase (e.g., Tag polymerase). If the target sequence is present 
in a sample, the primers will bind to the target and the polymerase will cause the 
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primers to be extended along the target sequence by adding on nucleotides. By raising 
and lowering the temperature of the reaction mixture, the extended primers will 
dissociate from the target to form reaction products, excess primers will bind to the 
target and to the reaction product and the process is repeated. Preferably reverse 
5 transcription and PGR™ amplification procedure may be performed in order to quantify 
the amount of mRNA amplified. Polymerase chain reaction methodologies are well 
known in the art. 

One preferred methodology for polynucleotide amplification employs 
RT-PCR, in which PGR is applied in conjunction with reverse transcription. Typically, 

10 RNA is extracted fix)m a biological sample, such as blood, serum, lymph node, bone 
marrow, sputum, urine and tumor biopsy samples, and is reverse transcribed to produce 
cDNA molecules. PGR ampUfication using at least one specific primer generates a 
cDNA molecule, which may be separated and visualized using, for example, gel 
electrophoresis. Amplification may be performed on biological samples taken fi-om a 

15 patient and firom an individual who is not afflicted with a cancer. The amplification 
reaction may be performed on several dilutions of cDNA spanning two orders of 
magnitude. A two-fold or greater increase in expression in several dilutions of the test 
patient sample as compared to the same dilutions of the non-cancerous sample is 
typically considered positive. 

20 Any of a variety of commercially available kits may be used to perform 

the amplification step. One such amplification technique is inverse PGR (see Triglia et 
al, Nucl Acids Res. 75:8186, 1988), which uses restriction enzymes to generate a 
Augment in the knoAvn region of the gene. The fragment is then circularized by 
intramolecular ligation and used as a template for PGR v^th divergent primers derived 

25 firom the known region. Within an alternative approach, sequences adjacent to a partial 
sequence may be retrieved by amplification with a primer to a linker sequence and a 
primer specific to a known region. The amplified sequences are typically subjected to a 
second round of amplification with the same linker primer and a second primer specific 
to the known region. A variation on this procedure, which employs two primers that 

30 initiate extension in opposite directions firom the known sequence, is described in WO 
96/38591. Another such technique is known as "rapid amplification of cDNA ends" or 
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RACE. This technique involves the use of an internal primer and an external primer, 
which hybridizes to a polyA region or vector sequence, to identify sequences that are 5* 
and 3* of a known sequence. Additional techniques include capture PGR (Lagerstrom et 
al, PCR Methods Applic, 7:111-19, 1991) and walking PGR (Parker a/., M/c/. Acids, 

5 Res, 7P:3055-60, 1991). Other methods employing amplification may also be 
employed to obtain a full length cDNA sequence. 

Another method for amplification is the ligase chain reaction (referred to 
as LGR), disclosed in Eur. Pat. AppL Publ. No. 320,308 (specifically incorporated 
herein by reference in its entirety). In LGR, two complementary probe pairs are 

10 prepared, and in the presence of the target sequence, each pair will bind to opposite 
complementary strands of the target such that they abut. In the presence of a ligase, the 
two probe pairs will link to form a single unit. By temperature cycling, as in PGR™, 
bound ligated imits dissociate from the target and then serve as "target sequences" for 
ligation of excess probe pairs. U.S. Patent No. 4,883,750, incorporated herein by 

15 reference in its entirety, describes an alternative method of amplification similar to LGR 
for binding probe pairs to a target sequence. 

Qbeta Replicase, described in PGT Intl. Pat. Appl. PubL No. 
PGT/US87/00880, incorporated herein by reference in its entirety, may also be used as 
still another amplification method in the present invention. In this method, a replicative 

20 sequence of RNA that has a region complementary to that of a target is added to a 
sample in the presence of an RNA polymerase. The polymerase will copy the 
replicative sequence that can then be detected. 

An isothermal amplification method, in which restriction endonucleases 
and ligases are used to achieve the amplification of target molecules that contain 

25 nucleotide 5 -[a-thio]triphosphates in one strand of a restriction site (Walker et ah, 
1992, incorporated herein by reference in its entirety), may also be useful in the 
amplification of nucleic acids in the present invention. 

Strand Displacement Amplification (SDA) is another method of carrying 
out isothermal amplification of nucleic acids which involves multiple rounds of strand 

30 displacement and synthesis, i.e. nick translation. A similar method, called Repair Ghain 
Reaction (RGR) is another method of amplification which may be useful in the present 
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invention and is involves annealing several probes throughout a region targeted for 
amplification, followed by a repair reaction in which only two of the four bases are 
present. The other two bases can be added as biotinylated derivatives for easy 
detection. A similar approach is used in SDA. 
5 Sequences can also be detected using a cyclic probe reaction (CPR). hi 

CPR, a probe having a 3* and 5' sequences of non-target DNA and an internal or 
"middle" sequence of the target protein specific RNA is hybridized to DNA which is 
present in a sample. Upon hybridization, the reaction is treated with RNaseH, and the 
products of the probe are identified as distinctive products by generating a signal that is 

10 released after digestion. The original template is aimealed to another cycling probe and 
the reaction is repeated. Thus, CPR involves amplifying a signal generated by 
hybridization of a probe to a target gene specific expressed nucleic acid. 

Still other amplification methods described in Great Britain Pat. Appl. 
No. 2 202 328, and in PCT hitl. Pat. Appl. Publ. No. PCT/US89/01025, each of which 

15 is incorporated herein by reference in its entirety, may be used in accordance with the 
present invention. In the former application, "modified" primers are used in a PCR- 
like, template and enzyme dependent synthesis. The primers may be modified by 
labeling with a capture moiety {e.g., biotin) and/or a detector moiety (e.g., enzyme). In 
the latter application, an excess of labeled probes is added to a sample. In the presence 

20 of the target sequence, the probe binds and is cleaved catalytically. After cleavage, the 
target sequence is released intact to be boimd by excess probe. Cleavage of the labeled 
probe signals the presence of the target sequence. 

Other nucleic acid amplification procedures include transcription-based 
amplification systems (TAS) (Kwoh et al, 1989; PCT Intl. Pat. Appl. Publ. No. WO 

25 88/10315, incorporated herein by reference in its entirety), including nucleic acid 
sequence based amplification (NASBA) and 3SR. In NASBA, the nucleic acids can be 
prepared for amplification by standard phenol/chloroform extraction, heat denaturation 
of a sample, treatment with lysis buffer and minispin columns for isolation of DNA and 
RNA or guanidinium chloride extraction of RNA. These amplification techniques 

30 involve annealing a primer that has sequences specific to the target sequence. 
Following polymerization, DNA/RNA hybrids are digested with RNase H while double 
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stranded DNA molecules are heat-denatured again. In either case the single stranded 
DNA is made fully double stranded by addition of second target-specific primer, 
followed by polymerization. The double stranded DNA molecules are then multiply 
transcribed by a polymerase such as T7 or SP6. In an isothermal cyclic reaction, the 
5 RNAs are reverse transcribed into DNA, and transcribed once again with a polymerase 
such as T7 or SP6. The resulting products, whether truncated or complete, indicate 
target-specific sequences. 

Eur. Pat. Appl. Publ. No. 329,822, incorporated herein by reference in its 
entirety, disclose a nucleic acid amplification process involving cyclically synthesizing 

10 single-stranded RNA ("ssRNA"), ssDNA, and double-stranded DNA (dsDNA), which 
may be used in accordance with the present invention. The ssRNA is a first template 
for a first primer oligonucleotide, which is elongated by reverse transcriptase 
(RNA-dependent DNA polymerase). The RNA is then removed firom resulting 
DNAJRNA duplex by the action of ribonuclease H (RNase H, an RNase specific for 

15 RNA in a duplex with either DNA or RNA). The resultant ssDNA is a second template 
for a second primer, which also includes the sequences of an RNA polymerase 
promoter (exemplified by T7 RNA polymerase) 5* to its homology to its template. This 
primer is then extended by DNA polymerase (exemplified by the large "Klenow" 
fragment of E. coli DNA polymerase I), resulting as a double-stranded DNA 

20 ("dsDNA") molecule, having a sequence identical to that of the original RNA between 
the primers and having additionally, at one end, a promoter sequence. Tliis promoter 
sequence can be used by the appropriate RNA polymerase to make many RNA copies 
of the DNA. These copies can then re-enter the cycle leading to very sv^ft 
amplification. With proper choice of enzymes, this amplification can be done 

25 isothermally without addition of enzymes at each cycle. Because of the cyclical nature 
of this process, the starting sequence can be chosen to be in the form of either DNA or 
RNA. 

PCT Intl. Pat. Appl. Publ. No. WO 89/06700, incorporated herein by 
reference in its entirety, disclose a nucleic acid sequence amplification scheme based on 
30 the hybridization of a promoter/primer sequence to a target single-stranded DNA 
("ssDNA*') followed by transcription of many RNA copies of the sequence. This 
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scheme is not cyclic; i.e. new templates are not produced from the resultant RNA 
transcripts. Other amplification methods include "RACE" (Frohman, 1990), and 
"one-sided PGR" (Ohara, 1989) which are well-known to those of skill in the art 

Compositions and Kits for the Detection of Cancer 
5 The present invention further provides kits for use within any of the 

above diagnostic methods. Such kits typically comprise two or more components 
necessary for performing a diagnostic assay. Components may be compounds, 
reagents, containers and/or equipment. For example, one container within a kit may 
contain a monoclonal antibody or fragment thereof that specifically binds to a breast 

10 tumor protein. Such antibodies or fragments may be provided attached to a support 
material, as described above. One or more additional containers may enclose elements, 
such as reagents or buffers, to be used in the assay. Such kits may also, or alternatively, 
contain a detection reagent as described above that contains a reporter group suitable 
for direct or indirect detection of antibody binding. 

15 The present invention also provides kits that are suitable for performing 

the detection methods of the present invention. Exemplary kits comprise 
oligonucleotide primer pairs each one of which specifically hybridizes to a distinct 
polynucleotide. Within certain embodiments, kits according to the present invention 
may also comprise a nucleic acid polymerase and suitable buffer. Exemplary 

20 oligonucleotide primers suitable for kits of the present invention are disclosed herein by 
SEQ ID NOs: 33-71. Exemplary polynucleotides suitable for kits of the present 
invention are disclosed in SEQ ID NO:73, SEQ ID NO:74, SEQ ID NO:75, SEQ ID 
N0:1, SEQ ID N0:3, SEQ ID NO:5, SEQ ID N0:6, SEQ ID NO:7, SEQ ID N0:11, 
SEQ ID NO: 13, SEQ ID NO: 15, SEQ ID NO: 17, SEQ ID NO: 19, SEQ ID NO:20, SEQ 

25 ID NO:21, SEQ ID NO:22, SEQ ID NO:23, SEQ ID NO:24, SEQ ID NO:30, SEQ ID 
NO:32, and lipophilin B. 

Alternatively, a kit may be designed to detect the level of mRNA 
encoding a breast tumor protein in a biological sample. Such kits generally comprise at 
least one oligonucleotide probe or primer, as described above, that hybridizes to a 

30 polynucleotide encoding a breast timior protein. Such an oligonucleotide may be used. 



wo 01/75171 



PCT/USOl/10631 



35 

for example, within a PGR or hybridization assay. Additional components that may be 
present within such kits include a second oligonucleotide and/or a diagnostic reagent or 
container to facilitate the detection of a polynucleotide encoding a breast tumor protein. 

In other related aspects, the present invention further provides 

5 compositions useful in the methods disclosed herein. Exemplary compositions 
comprise two or more oligonucleotide primer pairs each one of which specifically 
hybridizes to a distinct polynucleotide. Exemplary oligonucleotide primers suitable for 
compositions of the present invention are disclosed herein by SEQ E) NOs: 33-71. 
Exemplary polynucleotides suitable for compositions of the present invention are 

10 disclosed in SEQ ID NO:73, SEQ ID NO:74, SEQ ID NO:75, SEQ ID N0:1, SEQ ID 
N0:3, SEQ ID N0:5, SEQ ID NO:6, SEQ ID N0:7, SEQ ID N0:1 1, SEQ ID NO: 13, 
SEQ ID N0:15, SEQ ID N0:17, SEQ ID N0:19, SEQ ID NO:20, SEQ ID N0:21, SEQ 
ID NO:22, SEQ ID NO:23, SEQ ID NO:24, SEQ ID NO:30, SEQ ID NO:32, and 
lipophilin B. 

15 The following Examples are offered by way of illustration and not by 

way of limitation. 
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EXAMPLES 

EXAMPLE 1 
Differential Display 
5 This example discloses the use of differential display to enrich for 

polynucleotides that are over-expressed in breast tumor tissues. 

Differential display was performed as described in the literature {see, 
e.g., Liang, P. et al.. Science 257:967-971 (1993), incorporated herein by reference in 
its entirety) with the following modifications: (a) PGR amplification products were 
10 visualized on silver stained gels (b) genetically matched pairs of tissues were used to 
eliminate polymorphic variation (c) two different dilutions of cDNA were used as 
template to eliminate any dilutional effects {see^ Mou, E. et al., Biochem Biophy Res 
Commun. 199 :564-569 (1994) , incorporated herein by reference in its entirety). 

15 EXAMPLE 2 

Preparation of cDNA Subtraction Library 
This example discloses the preparation of a breast tumor cDNA 
subtraction library enriched in breast tumor specific polynucleotides. 

cDNA library subtraction was performed as described with some 

20 modification. See, Hara, T. et al., Blood 84: 189-199 (1994), incorporated herein by 
reference in its entirety. The breast tumor library (tracer) that was made from a pool of 
three breast tumors was subtracted with normal breast library (driver) to identify breast 
tumor specific genes. More recent subtractions utilized 6-10 normal tissues as driver to 
subtract out common genes more efficiently, with an emphasis on essential tissues 

25 along with one "immunological" tissue (e.g., spleen, lymph node, or PBMC), to assist 
in the removal of cDNAs derived from lymphocyte infiltration in tumors. The breast 
tumor specific subtracted cDNA library was generated as follows: driver cDNA library 
was digested with EcoRJ, NotI, and Sful (Sful cleaves the vector), filled in with DNA 
polymerase klenow fragment. After phenol-chloroform extraction and ethanol 

30 precipitation, the DNA was labeled with Photoprobe biotin and dissolved in H2O. 
Tracer cDNA library was digested with BamHI and Xhol, phenol chloroform extracted. 
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passed through Chroma spin-400 columns, ethanol precipitated, and mixed with driver 
DNA for hybridization at SS'^C for 20 hours [long hybridization (LH)]. The reaction 
mixture was then subjected to the streptavidin treatment followed by phenol/chloroform 
extraction for a total of four times. Subtracted DNA was precipitated and subjected to a 

5 hybridization at 68°C for 2 hours with driver DNA again [short hybridization (SH)]. 
After removal of biotinylated double-stranded DNA, subtracted cDNA was ligated into 
BamHI/XhoI site of Chloramphenicol resistant pBCSK"*" and transformed into 
ElectroMax E. coli DHIOB cells by electroporation to generate subtracted cDNA 
library. To clone less abundant breast tumor specific genes, cDNA library subtraction 

10 was repeated by subtracting the tracer cDNA library with the driver cDNA library plus 
abundant cDNAs from primary subtractions. This resulted in the depletion of these 
abundant sequences and the generation of subtraction libraries that contain less 
abundant sequences. 

To analyze the subtracted cDNA library, plasmid DNA was prepared 

15 from 

100-200 independent clones, wliich were randomly picked from the subtracted library, 
and characterized by DNA sequencing. The determined cDNA and expected amino 
acid sequences for the isolated cDNAs were compared to known sequences using the 
most recent Genbank and htiman EST databases. 

20 

EXAMPLES 

PCR-SUBTRACTION 

This example discloses PCR subtraction to enrich for breast tumor 
specific 
25 polynucleotides. 

PCR-subtraction was performed essentially as described in the literature. 
See, Diatchenko, L. et al., Proc Natl Acad Sci USA. 93:6025-6030 (1996) and Yang, 
G.P. et al., Nucleic acids Res. 27:1517-23 (1999), incorporated herein by reference in 
their entirety. Briefly, this type of subtraction works by ligating two different adapters 
30 to different aliquots of a restriction enzyme digested tester (breast tumor) cDNA 
sample, followed by mixing of the testers separately with excess driver (v^thout 
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adapters). This first hybridization resxilts in normalization of single stranded tester 
specific cDNA due to the second order kinetics of hybridization. These separate 
hybridization reactions are then mixed without denaturation, and a second hybridization 
performed which produces the target molecules; double stranded cDNA firagments 
5 containing both of the different adapters. Two rounds of PGR were performed, which 
results in the exponential amplification of the target population molecules (normalized 
tester specific cDNAs), while other fragments were either xmamplified or only 
amplified in a linear manner. The subtractions performed included a pool of breast 
tumors subtracted with a pool of normal breast and a pool of breast tumors subtracted 

10 with a pool of normal tissues including PBMC, brain, pancreas, liver, small intestine, 
stomach, heart and kidney. 

Prior to cDNA synthesis RNA was treated with DNase I (Ambion) 
in the presence of RNasin (Promega Biotech, Madison, WI) to remove DNA 
contamination. The cDNA for use in real-time PGR tissue panels was prepared using 

15 25 [il Oligo dT (Boehringer-Mannheim) primer with superscript II reverse transcriptase 
(Gibco BRL, Bethesda, MD). 

EXAMPLE 4 

Detection Of Breast Ganger Using Breast-Specific Antigens 
20 The isolation and characterization of the breast-specific antigens B5 II S 

and B533S is described in U.S. Patent Application 09/346,327, filed July 2, 1999, the 
disclosure of which is hereby incorporated by reference in its entirety. The determined 
cDNA sequence for B511S is provided in SEQ ID NO: 30, with the corresponding 
amino acid sequence being provided in SEQ ID NO: 31. The determined cDNA 
25 sequence for B533S is provided in SEQ ID NO: 32. The isolation and characterization 
of the breast-specific antigen B726P is described in U.S. Patent Applications 
09/285,480, filed April 2, 1999, and 09/433,826, filed November 3, 1999, tiie 
disclosures of which are hereby incorporated by reference in their entirety. 

The determined cDNA sequences for splice variants of B726P are 
30 provided in SEQ ID NO: 13, 15, 17 and 19-24, with the corresponding amino acid 
sequences being provided in SEQ ID NO: 14, 16, 18 and 25-29. 
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The isolation and characterization of the breast-specific antigen B305D 
forms A and C has been described in U.S. Patent Application 09/429,755, filed October 
28, 1999, the disclosure of which is hereby incorporated by reference in its entirety. 
Determined cDNA sequences for B305D isoforms A and C are provided in SEQ ID 
5 NO: 1,3 and 5-7, with the corresponding amino acid sequences being provided in SEQ 
ID NO: 2, 4 and 8-10. 

The isolation and characterization of the breast-specific antigen B311D 
has been described in U.S. Patent Application 09/289,198, filed April 9, 1999, the 
disclosure of which is hereby incorporated by reference in its entirety. The determined 

10 cDNA sequence for B311D is provided in SEQ ID NO: 11, with the corresponding 
amino acid sequence being provided in SEQ ID N0:12. 

cDNA sequences for mammaglobin are provided in Figs. 4 and 5, with 
the cDNA sequence for GABAti being provided in Fig 6 and are disclosed in SEQ ID 
NOs: 73-75, respectively. 

15 The isolation and characteization of the breast-specific antigen lipophilin 

B has been described in U.S. Patent Application 09/780,842, filed February 8, 2001, the 
disclosure of which is hereby incorporated by reference in its entirety. The determined 
cDNA sequence for lipophilin B is provided in SEQ ID NO:76, with tlie corresponding 
amino acid sequence being provided in SEQ ID NO:77. The nucleotide sequences of 

20 several sequence variants of lipophilin B are also described in the 09/780,842 
application. 



EXAMPLE 5 

MlCROARRAY ANALYSIS 

25 This example discloses the use of microarray analyses to identify 

polynucleotides that are at least two-fold overexpressed in breast tumor tissue samples 

as compared to normal breast tissue samples. 

mKNA expression of the polynucleotides of interest was performed as 

follows. cDNA for the different genes was prepared as described above and arrayed on 
30 a glass slide (Incyte, Palo Alto, CA). The arrayed cDNA was then hybridized with a 

1:1 mixture of Cy3 or Cy5 fluorescent labeled first strand cDNAs obtained firom 



wo 01/75171 



PCT/USOl/10631 



40 

polyA+ RNA from breast tumors, normal breast and normal tissues and other tumors as 
described in Shalon, D. et al.. Genome Res, 6:639-45 (1996), incorporated herein by 
reference in its entirety. Typically Cy3 (Probe 1) was attached to cDNAs from breast 
tumors and Cy5 (Probe 2) to normal breast tissue or other normal tissues. Both probes 
5 were allowed to compete with the immobilized gene specific cDNAs on the chip, 
washed then scanned for fluorescence intensity of the individual Cy3 and Cy5 
fluorescence to determine extent of hybridization. Data were analyzed using 
GEMTOOLS software (Incyte, Palo Alto, CA) which enabled the overexpression 
patterns of breast tumors to be compai'ed with normal tissues by the ratios of Cy3/Cy5. 
10 The fluorescence intensity was also related to the expression level of the individual 
genes. 

DNA microarray analyses was used primarily as a screening tool to 
determine tissue/tumor specificity of cDNA's recovered from the differential display, 
cDNA library and PGR subtractions, prior to more rigorous analysis by quantitative 

15 RT-PCR, northern blotting, and immunohistochemistry. Microarray analysis was 
performed on two microchips. A total of 3603 subtracted cDNA's and 197 differential 
display templates were evaluated to identify 40 candidates for further analysis by 
quantitative PGR. From these candidates, several were chosen on the basis of favorable 
tissue specificity profiles, including B305D, B311D, B726P, B511S and B533S, 

20 indicating their overexpression profiles in breast tumors and/or normal breast versus 
other normal tissues. It was evident that the expression of these genes showed a high 
degree of specificity for breast tumors and/or breast tissue. In addition, these genes 
have in many cases complementary expression profiles. 

The two known breast-specific genes, manmiaglobin and y- 

25 aminobutyrate type A receptor n subunit (GABAti) were also subjected to microarray 
analysis. mRNA expression of mammaglobin has been previously described to be 
upregulated in proliferating breast tissue, including breast tumors. See, (Watson et al., 
Cancer Res., 56: 860-5 (1996); Watson et al., Gancer Res., 59: 3028-3031 (1999); 
Watson et al.. Oncogene. _i6:8 17-24 (1998), incorporated herein by reference in their 

30 entirety). The GABAtc mRNA levels were over-expressed in breast tumors. Previous 
studies had demonstrated its overexpression in uterus and to some degree in prostate 
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and lung (Hedblom et al., J Biol. Chem. 272:15346-15350 (1997)) but no previous 
study had indicated elevated levels in breast tumors. 

EXAMPLE 6 

5 Quantitative Real-time PGR Analysis 

This example discloses the use of quantitative Real-time PGR to confirm 

the 

microarray identification polynucleotide that are at least two-fold overexpressed in 
breast tumor tissue samples as compared to normal breast tissue samples. 

10 The tumor- and/or tissue-specificity of the polynucleotides identified by 

the microarray analyses disclosed herein in Example 5, were confirmed by quantitative 
PGR analyses. Breast metastases, breast tumors, benign breast disorders and normal 
breast tissue along with other normal tissues and tumors were tested in quantitative 
(Real time) PGR. This was performed either on the ABI 7700 Prism or on a 

15 GeneAmp® 5700 sequence detection system (PE Biosystems, Foster Gity, CA). Tlie 
7700 system uses a forward and a reverse primer in combination with a specific probe 
designed to anneal to sequence between the forward and reverse primer. This probe 
was conjugated at the 5 'end with a fluorescent reporter dye and a quencher dye at the 
other 3' end (Taqman™). During PGR the Taq DNA polymerase with it's 5'-3' 

20 nuclease activity cleaved the probe which began to fluoresce, allowing the reaction to 
be monitored by the increase in fluorescence (Real-time). Holland et al., Proc Natl 
Acad Sci USA. 88:7276-7280 (1991). The 5700 system used SYBR® green, a 
fluorescent dye, that only binds to double stranded DNA (Schneeberger et al., PGR 
Methods Appl. 4:234-8 (1995)), and the same forward and reverse primers as the 7700 

25 instrument. No probe was needed. Matching primers and fluorescent probes were 
designed for each of the genes according to tlie Primer Express program (PE 
Biosystems, Foster Gity, GA). 



Table 1. 



Primer and Probe Sequences for the Genes of. 


nterest 




Forward Primer 


Reverse primer 


Probe 


Manunaglobin 


TGCCATAGATG 
AATTGAAGGAA 
TG(SEQID 


TGTCATATATTAA 
TTGCATAAACACC 
TCA (SEQ ID 


TCTTAACCAAACG 
GATGAAACTCTGA 
GCAATG (SEQ ID 
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Forward Primer 


Reverse primer 


Probe 




JnU:46) 


JVKJ:4yj 


JnL):jU) 


B305D-C fonn 


AAAGCAGATGG 
TGGTTGAGGTT 
(SEQ IDNO:39) 


CCTGAGACCAAA 

TGGCTTCTTC 

(SEQIDNO:40) 


ATTCCATGCCGGC 

TGCTTCTTCTG 

(SEQIDN0:41) 


d5 1 ID 


CCLfC 1 1 CI IjACA 

ACACTAGAGAT 

C(SEQIDNO:63) 


r^f^T' ATA A A A nr/^T' 

TATGTACCAAAA 
ATGAAGT (SEQ ID 
NO:64) 


CCCC ICCC 1L,AIj(j 
GTATGGCCC (SEQ 
ID NO:65) 


B/zor 


ICICjUI 1 1 ICIC 
ATTCTTTATTCA 
TTTATT(SEQID 
NO:42) 


1 (jCCAACjvjACjI-U 

GATTATCT(SEQ 
IDNO:43) 


L-A ACC-ACU 1 oAL-A 
AACACTGGAATTA 
CAGG (SEQ ID 
NO:44) 


B533S 


CCL- 1 1 1 C 1 CACC 
CACACACTGT 
(SEQ IDNO:66) 


T/^r^ A nr* i •/^t/^t/"' a t* 
ICjCAI IClClCAl 

ATGTGGAAGCT 
(SEQIDNO:67) 


CCGGuCL 1 CACjOL- 
ATATACTATTCTA 
CTGTCTG (SEQ ID 


GABAti 


AAGCCTCAGAG 
TCCTTCCAGTAT 
G(SEQ IDNO:36) 


AAATATAAGTGA 
AGAAAAAAATTA 
GTAGAT (SEQ ID 
NO:72) 


AATCCATTGTATC 
TTAGAACCGAGGG 
ATTTGTTTAGA 
(SEQ IDNO:38) 


B511S 


GACATTCCAGTT 

TTACCCAAATG 

G(SEQIDNO:69) 


TGCAGAAGACTC 

AAGCTGATTCC 

(SEQIDNO:70) 


TCTCAGGGACACA 

CTCTACCATTCGG 

GA(SEQIDN0:71) 



The concentrations used in the quantitative PGR for the forward primers 
for mammaglobin, GABAti, B305D C form, B311D, B511S, B533S and B726P were 
5 900, 900, 300, 900, 900, 300 and 300nM respectively. For tlie reverse primers they 
were 300, 900, 900, 900, 300, 900 and 900nM respectively. Primers and probes so 
produced were used in the universal thermal cycling program in real-time PGR. They 
were titrated to determine the optimal concentrations using a checkerboard approach. A 
pool of cDNA from target tumors was used in this optimization process. The reaction 

10 was performed in 25|il volumes. The fmal probe concentration in all cases was 160nM. 
dATP, dCTP and dOTP were at 0.2mM and dUTP at 0.4mM. Amplitaq gold and 
Amperase UNO (PE Biosystems, Foster Gity, GA) were used at 0.625 units and 0.25 
units per reaction. MgGb was at a final concentration of 5mM. Trace amounts of 
glycerol, gelatin and Tween 20 (Sigma Ghem Go, St Louis, MO) were added to 

15 stabilize the reaction. Each reaction contained 2|li1 of diluted template. The cDNA firom 
RT reactions prepared as above was diluted 1:10 for the gene of interest and 1:100 for 
P-Actin. Primers and probes for p-Actin (PE Biosystems, Foster Gity, GA) were used in 
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a similar manner to quantitate the presence of p-actin in the samples. In the case of the 
SYBR® green assay, the reaction mix (25^1) included 2.5^xl of SYBR green buffer, 2|il 
of cDNA template and 2.5^1 each of the forward and reverse primers for the gene of 
interest. This mix also contained 3mM MgCl2, 0.25units of AmpErase UNG, 0.625 
5 units of Amplitaq gold, 0.08% glycerol, 0.05% gelatin, 0.0001% Tween 20 and ImM 
dNTP mix. In both formats, 40 cycles of amplification were performed. 

In order to quantitate the amoimt of specific cDNA (and hence initial 
mRNA) in the sample, a standard curve was generated for each run using the plasmid 
containing the gene of interest. Standard curves were generated using the Ct values 

10 determined in the real-time PGR which were related to the initial cDNA concentration 
used in the assay. Standard dilutions ranging from 20-2x10^ copies of the gene of 
interest were used for this purpose. In addition, a standard curve was generated for the 
housekeeping gene P-actin ranging fi-om 200fg-2000pg to enable normalization to a 
constant amount of p-Actin. This allowed the evaluation, of the over-expression levels 

15 seen with each of the genes. 

The genes B311D, B533S and B726P were evaluated in quantitative 
PGR as described above on two different panels consisting of: (a) breast tumor, breast 
normal and normal tissues; and (b) breast tumor metastases (primarily lymph nodes), 
using the primers and probes shown above in Table I . The data for panel (a) is shown in 

20 Figure 1 for all three genes. The three genes showed identical breast tissue expression 
profiles. However, the relative level of gene expression was very different in each case. 
B31 ID in general was expressed at lower levels than B533S and both less than B726P, 
but all three were restricted to breast tissue. The quantitative PGR thus confirmed there 
was a differential expression between normal breast tissue and breast tumors for all 

25 three genes, and that approximately 50% of breast tumors over-expressed these genes. 

When tested on a panel of distant metastases derived fi-om breast cancers all three genes 

reacted with 14/21 metastases and presented similar profiles. All three genes also 

exhibited increasing levels of expression as a function of pathological stage of the 

tumor, as shown for B533S in Figure 2. 
30 Mammaglobin is a homologue of a rabbit uteroglobin and the rat steroid 

binding protein subiinit G3 and is a low molecular weight protein that is highly 

glycosylated. In contrast to its homologs, mammaglobin has been reported to be breast 
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specific and over-expression has been described in breast tumor biopsies (23%) and 
primary and metastatic breast tumors (-75%) with reports of the detection of 
mammaglobin mRNA expression in 91% of lymph nodes fi-om metastatic breast cancer 
patients. However, more rigorous analysis of manmiaglobin gene expression by 
5 microarray and quantitative PGR as described above (panels (a) and (b) and a panel of 
other tumors and normal tissues and additional breast tumors), showed expression at 
significant levels in skin and salivary gland with much lower levels in esophagus and 
trachea, as shown in Table 2 below. 
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Table 2 

Normalized Distribution of Mammaglobin and B511S mRNA in Various Tissues 



Tissue 


Mean Copies 
Mammaglobin 
/ng p-Actin ± 
SD 


PCR 
Positive 


Mean Copies B51 IS 
/ng B-Actin + SD 


PCR 
Positive 


PCR Positive 
(Mammaglobin/ 
B511S) 


Breast 
Tumors 


1233.88+3612 
.74 


31/42 


1800.40+3893.24 


33/42 


38/42 


Breast 
tumor 
Metastases 


1912.54±4625 
.85 


14/24 


3329.50+10820.71 


14/24 


17/24 


Benign 
Breast 
disorders 


121.87+78.63 


3/3 


524.66±609.43 


2/3 


3/3 


Normal 
breast 


114.19±94.40 


11/11 


517.64±376.83 


8/9 


11/11 


Breast 
reduction 


231.50±276.6 
8 


2/3 


482.54±680.28 




2/3 


Other 
tumors 


0.13±0.65 


1/39 


24.17±36.00 


5/23 




Salivary 
gland 


435.65+705.1 
1 


2/3 


45766.61+44342.43 


3/3 




Skin 


415.74+376.1 
4 


7/9 


7039.05+7774.24 


9/9 




Esophagus 


4.45±3.86 


2/3 


1.02+0.14 


0/3 




Bronchia 


0.16 


0/1 


84.44±53.31 


2/2 




Other 
normal 
tissues 


0.33±1.07 


0/85 


5.49+10.65 


3/75 





5 The breast-specific gene B511S, while having a different profile of 

reactivity on breast tumors and normal breast tissue to manunaglobin, reacted with the 
same subset of normal tissues as mammaglobin. B511S by PSORT analysis is 
indicated to have an ORF of 90aa and to be a secreted protein as is the case for 
manunaglobin. B511S has no evidence of a transmembrane domain but may harbor a 
10 cleavable signal sequence. Mammaglobin detected 14/24 of distant metastatic breast 
tumors, 31/42 breast tumors and exhibited ten-fold over-expression in tumors and 
metastases as compared to normal breast tissue. There was at least 300-fold over- 
expression in normal breast tissue versus other negative normal tissues and tumors 
tested, which were essentially negative for mammaglobin expression. B511S detected 
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33/42 breast tumors and 14/24 distant metastases, while a combination of B51 IS with 
mammaglobin would be predicted to detect 38/42 breast tumors and 17/24 metastatic 
lesions (Table 2 above). The quantitative level of expression of B511S and 
mammaglobin were also in similar ranges, in concordance with the microarray profiles 
5 observed for these two genes. Other genes that were additive with mammaglobin are 
shown in Table 3. 



1 
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Table 3 



mRNA Complementation of Mammaglobin with Other Genes 





Mammaglobin 
Positive 


Mammaglobin Negative 






B305D 


GABAti 


B726P 


B305D + 
GABATt 


B305D + GABAti 
+ B726P 


Breast 
Metastases 


13/21 


2/8 


5/8 


3/8 


7/8 


8/8 


Breast 
tumors 


18/25 


3/7 


4/7 


5/7 


7/7 


7/7 



B305D was shown to be highly over-expressed in breast tumors, prostate 
5 tumors, normal prostate tissue and testis compared to normal tissues, including normal 
breast tissue. Different splice variants of B305D have been identified with form A and 
C being the most abundant but all tested have similar tissue profiles in quantitative 
PGR. The A and G forms contain ORF's of 320 and 385 aa, respectively. B305D is 
predicted by PSORT to be a Type II membrane protein that comprises a series of 

10 ankyrin repeats. A known gene shown to be complementary vdth B305D, in breast 
tumors, was GABAti, This gene is a member of the GABAa receptor family and 
encodes a protein that has 30-40% amino acid homology v^th other family members, 
and has been shown by Northern blot analysis to be over-expressed in lung, thymus and 
prostate at low levels and highly over-expressed in uterus. Its expression in breast tissue 

15 has not been previously described. This is in contrast to other GABAa receptors that 
have appreciable expression in neuronal tissues. Tissue expression profiling of this 
gene showed it to be over-expressed in breast tumors in an inverse relationship to the 
B305D gene (Table 3). GABAn detected 15/25 tumors and 6/21 metastases including 4 
tumors and 5 metastases missed by mammaglobin. In contrast, B305D detected 13/25 

20 breast tumors and 8/21 metastases, again including 3 tumors and 2 metastases missed 
by mammaglobin. A combination of just B305D and the GABAtt would be predicted 
to identify 22/25 breast tumors and 14/21 metastases. The combination of B305D and 
GABAn with manmiaglobin in detecting breast metastases is shown in Table 3 above 
and Figures. 3 A and 3B. This combination detected 20/21 of the breast metastases as 

25 well as 25/25 breast tumors that were evaluated on the same panels for all three genes. 
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The one breast metastasis that was negative for these three genes was strongly positive 
for B726P (Figs. 3 A and 3B). 

To evaluate the presence of circulating tumor cells, an immunocapture 
(cell capture) method was employed to Gist emich for epithelial cells prior to RT-PCR 
5 analysis. Immunomagnetic polystyrene beads coated with specific monoclonal 
antibodies to two glycoproteins on the surface of epithelial cells were used for this 
purpose. Such an enrichment procedure increased the sensitivity of detection (--100 
fold) as compared to direct isolation of poly RNA, as shovm in Table 4. 



10 
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Table 4 

Extraction of Mammaglobin Positive Cells (MB415) Spiked 



into Who 


e Blood and Detection by Real 


-time PGR 


MB415 cells/ml Blood 


Epithelial cell extraction 
(PolyA^RNA) 


Direct Extraction 
(Poly A* RNA) 




Copies Mammaglobin/ng p-Actin 


100000 


54303.2 


58527.1 


10000 


45761.9 


925.9 


1000 


15421.2 


61.6 


100 


368.0 


5.1 


10 


282.0 


1.1 


1 


110.2 


0 


0 


0 


0 



5 Mammaglobin-positive cells (MB415) were spiked into whole blood at 

various concentrations and then extracted using either epithelial cell enrichment or 
direct isolation from blood. Using enrichment procedures, mammaglobin mRNA was 
found to be detectable at much lower levels than when direct isolation was used. Whole 
blood samples from patients with metatastic breast cancer were subsequently treated 

10 with the immunomagnetic beads. Poly A"^ RNA was then isolated, cDNA prepared and 
run in quantitative PGR using two gene specific primers (Table 1) and a fluorescent 
probe (Taqman™). As observed in breast cancer tissues, complementation was also 
seen in the detection of circulatihg tumor cells derived from breast cancers. Again, 
mammaglobin PGR detected circulating tumor cells in a high percentage of blood 

15 samples, albeit at low levels, from metastatic breast cancer patients (20/32) when 
compared to the normal blood samples (Table 5) but several of the other genes tested to 
date fiirther increased this detection rate. This included B726P, B305D, B311D, B533S 
and GABAtt. The detection level of mammaglobin in blood samples from metastatic 
breast cancer patients is higher than described previously (62 vs. 49%), despite testing 

20 smaller blood volumes, probably because of the use of epithelial marker-specific 
enrichment in our study. A combination of all the genes tested indicate that 27/32 
samples were positive by one or more of these genes. 
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Table 5 



Gene Complementation in Epithelial Cells Isolated from Blood of Normal Individuals 
and Metastatic Breast Cancer Patients 



Sample ID 


Mammaglobin 


B305D 


B311D 


B533S 


B726P 


GABAtc 


Combo 


2 


+ 


- 


- 




- 


- 


+ 


3 




- 


- 


+ 


- 


- 


+ 


5 


+ 






- 


+ 


- 


+ 


6 


+ 


- 


+ 


+ 


+ 


- 


+ 


8 


- 


- 




- 


- 


- 


+ 


9 


+ 


+ 


+ 


- 


+ 


- 


+ 


10 


+ 


- 


+ 


- 


+ 


- 


+ 


11 


- 


- 




- 


- 


- 


- 


12 


+ 


+ 


+ 


- 


- 


- 


+ 


13 


- 


- 


- 




- 


- 


+ 


15 


- 


- 


- 


- 


- 


- 


- 


18 


+ 


- 


- 


- 


- 


- 


+ 


19 


+ 


- 


- 


- 


- 


+ 


+ 


21 


+ 


- 


- 


- 


- 


- 




22 


- 


- 


- 


- 


- 


- • 


- 


23 


+ 


- 


- 


- 


- 


- 


+ 


24 




- 


- 


- 


- 


- 


+ 


25 


- 


+ 


- 


- 


- 


- 


+ 


26 


- 


- 


- 


- 


- 


- 


- 


29 


+ 


- 


+ 


+ 


+ 


- 


+ 


31 


+ 


- 




+ 


- 


- 


+ 


32 


- 


- 


- 


- 


- 


± 


± 


33 


- 


- 


- 


- 


+ 


- 


+ 


34 


+ 


- 


- 




- 


+ 


+ 


35 


+ 


- 


- 


- 


+ 


- 


+ 


36 


- 


- 


- 


- 


- 




+ 


37 


+ 


- 


- 




- 


- 


+ 


38 


- 


- 


- 


- 


- 


- 


- 


40 


+ 


- 


- 


- 


- 


- 


+ 


41 


+ 


- 


- 


+ 


- 


- 


+ 


42 


+ 


- 


- 


- 


- 


- 


+ 


43 


- 


- 


- 


- 


- 


+ 


+ 


Donor 104 












+ 


+ 


Donor 348 












Nd 




Donor 392 












Nd 




Donor 408 












Md 




Donor 244 
















Donor 355 
















Donor 264 
















Donor 232 












Nd 




Donor 12 
















Donor 41 5 












Nd 




Donor 35 
















Sensitivity 


20/32 


4/32 


7/32 


9/32 


7/32 


4/32 


27/32 



t 



5 In further studies, mammaglobin, GABAti, B305D (C form) and B726P 

specific primers and specific Taqmaii probes were employed in different combinations 
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to analyze their combined mRNA expression profile in breast metastases (B. met) and 
breast tumor (B. tumors) samples using real-time PGR. The forward and reverse 
primers and probes employed for manmiaglobin, B305D (C form) and B726P are 
shown in Table 1. The forward primer and probe employed for GABAti are shown in 
5 Table 1 , with the reverse primer being as follows: 
TTCAAATATAAGTGAAGAAAAAATTAG-TAGATCAA (SEQ ID N0:51). As 
shown below in Table 6, a combination of mammaglobm, GABAn, B305D (C form) 
and B726P was found to detect 22/22 breast tumor samples, with an increase in 
expression being seen in 5 samples (indicated by ++). 
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Table 6 



Real-time PGR Detection of Tumor Samples using Different Primer Combinations 



Tumor sample 


Mammaglobin 


Mammaglobm + 
GABA 


Mammaglobin + 
GABA + B305D 


Mammaglobin + GABA + 
B305D + B726P 


B. Met316A 




+ 




+ 


B. Met317A 


+ 


+ 


+ 


+ 


B. Met318A 




+ 




++ 


B. Met 595A 


+ 


+ 


+ 


+ 


B. Met 611 A 


+ 


+ 


+ 


+ 


B. Met612A 


+ 




+ 




B.Met614A 




+ 


+ 


+ 


B. Met 616A 




+ 


+ 


+ 


B.Met618A 


+ 


+ 


+ 


+ 


B. Met 620A 


+ 




+ 


+ 


B. Met 621 A 


+ 


+ 


+ 


+ 


B. Met 624A 


+ 




+ 


+ 


B. Met 625A 






+ 




B- Met 627A 


+ 




+ 


+ 


B. Met 629A 




+ 


+ 


+ 


B.Met 631A 


+ 


+ 


+ 


+ 


B. Tumor 154A 




+ 


+ 


++ 


B. Tumor 155 A 


+ 


+ 


+ 


++ 


B. Tumor 8 ID 






+ 


++ 


B. Tumor 209A 




+ 


+ 


+ 


B. Tumor 208A 




+ 


+ 


-H- 


B. Tumor ] OA 


+ 


+ 


+ 





The increase of message signals by the addition of specific primers was 
5 further demonstrated in a one plate experiment emplojnng the four tumor samples B. 
met 316A, B. met 317A, B. tumor 81D and B. tumor 209A. 

Expression of a combination of mammaglobin, GABAti, B305D (C 
form) and B726P in a panel of breast tumor and normal tissue samples was also 
detected using real-time PGR with a SYBR Green detection system instead of the 
1 0 Taqman probe approach. The results obtained using this system are shown in Figure 7. 

EXAMPLE 7 

Quantitative PGR in Peripheral Blood of Breast Ganger Patients 
The known genes evaluated in this study were mammaglobin and y 
1 5 aminobutyrate type A receptor n subunit (GABAti). hi order to identify novel genes 
which are over-expressed in breast cancer we have used an improved version of the 
differential display RT-PGR (DDPGR) technique (Liang et al., Science 257:967-971 
(1993); Mou et al., Biochem Biophy Res Gommun. 199:564-569 (1994)); cDNA library 
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extraction methods (Kara et al., Blood 84:189-199 (1994)) and PGR subtraction 
(Diatchenko et al., Proc Natl Acad Sci U S A, 93:6025-6030 (1996); Yang et al.. 
Nucleic Acids Res. 27:1517-23 (1999)). 

Differential display resulted in the recovery of two cDNA fragments 
5 designated as B305D and B31 ID (Houghton et al.. Cancer Res. 40: Abstract #217, 32- 
33, (1999). B51 IS and B533S are two cDNA fragments isolated using cDNA library 
subtraction approach (manuscript in preparation) while the B726P cDNA fragment was 
derived from PGR subtraction (Jiang et al., Proceedings of the Amer Assoc Gancer Res. 
40: Abstract #216, 32 (1999); Xu et al.. Proceedings of the Amer Assoc Gancer Res. 

10 40: Abstract #21 15, 319 (1999); and Molesh et al., Proceedings of Amer Assoc Gancer 
Res. 41 :Abstract #4330, 681 (2000). 

Three of the novel genes, B31 ID, B533S and B726P, showed identical 
breast tissue expression profile by quantitative PGR analysis. These genes were 
evaluated in quantitative PGR on two different panels consisting of (a) breast tumor, 

15 breast normal and normal tissues and (b) panel of breast tumor metastases (primarily 
lymph nodes). Primers and probes used are shown in Table 1 . The data for panel (a) is 
shown in Figure 2 for all three genes. Overall, the expression profiles are comparable 
and are in the same rank order, however, the levels of expression are considerably 
different. B31 ID in general was expressed at lower levels than B533S and both less 

20 than B726P but all three were restricted to breast tissue. All three sequences were used 
to search against the Genbank database. Both B311D and B533S sequences contain 
different repetitive sequences and an ORF has not been identified for either. B726P is a 
novel gene, with mRNA splicing yielding several different putative ORF's. 

The quantitative PGR confirmed there was a differential mRNA 

25 expression 

between normal breast tissue and breast tumors, with approximately 50% of breast 
tumors overexpressed these genes. When tested on a panel of distant metastases derived 
from breast cancers all three genes reacted with 14/21 metastases and presented similar 
profiles (data not shown). Interestingly, when tested on a prostate cancer panel, all 

30 three genes identified the same 3/24 prostate tumors but at much lower expression 
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levels than in breast. This group of genes exhibited increasing levels of expression as a 
function of pathological stage of the tumor as shown for B533S. 

More rigorous analysis of mammaglobin gene expression by microarray, 
and quantitative PGR showed expression at significant levels in skin and salivary gland 

5 and much lower levels in esophagus and trachea. B511 S had a slightly different profile 
of reactivity on breast tumors and normal breast tissue when compared to 
mammaglobin, yet reacted with a similar subset of normal tissues as mammaglobin, 
Mammaglobin detected 14/24 of distant metastatic breast tumors, 31/42 breast tumors 
and exhibited ten-fold over-expression in tumors and metastases as compared to normal 

10 breast tissue. There was at least 300-fold over-expression of mammaglobin in normal 
breast tissue versus other negative normal tissues and tumors tested. B511S detected 
33/42 breast tumors and 14/24 distant metastases. A combination of B511S with 
mammaglobin would be predicted to detect 38/42 breast tumors and 17/24 metastatic 
lesions. The quantitative level of expression of B51 IS and mammaglobin were also in 

15 similar ranges, in concordance with the microarray profiles observed for these two 
genes. 

Certain genes complemented mammglobin's expression profile, Le. were 
shown to express in tumors that mammaglobin did not. B305D was highly over- 
expressed in breast tumors, prostate tumors, normal prostate tissue and testis compared 

20 to normal tissues including normal breast tissue. Different splice variants of B305D 
were identified with the forms A and C being the most abundant. All forms tested had 
similar tissue profiles in quantitative PGR. The A and G forms contain ORF's of 320 
and 385 aa, respectively. A known gene shown to be complementary with B305D, in 
breast tumors, was GABAn. This tissue expression profile is in contrast to other 

25 GABAa receptors that typically have appreciable expression in neuronal tissues. An 
additional observation was that tissue expression profiling of this gene showed it to be 
over-expressed in breast tumors in an inverse relationship to the B305D gene (Table 3). 
GAB All detected 15/25 tumors and 6/21 metastases including 4 tumors and 5 
metastases missed by manunaglobin. In contrast, B305D detected 13/25 breast tumors 

30 and 8/21 metastases again including 3 tumors and 2 metastases missed by 
mammaglobin. A combination of just B305D and the GABAti would be predicted to 
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identify 22/25 breast tumors and 14/21 metastases. This combination detected 20/21 of 
the breast metastases as well as 25/25 breast tumors that were evaluated on the same 
panels for all three genes. The one breast metastasis that was negative for these three 
genes was strongly positive for B726P. 
5 The use of microarray analysis followed by quantitative PCR provided a 

methodology to accurately determine the expression of breast cancer genes both in 
breast tissues (tumor and normal) as well as in normal tissues and to assess their 
diagnostic and therapeutic potential. Five novel genes and two known genes were 
evaluated using these techniques. Three of these genes B311D, B533S and B726P 

10 exhibited concordant mRNA expression and collectively the data is consistent with 
coordinated expression of these three loci at the level of transcription control. All three 
genes showed differential expression in breast tumors versus normal breast tissue and 
the level of overexpression appeared related to the pathological stage of the tumor. In 
the case of mammaglobin, expression was found in other tissues apart from breast 

15 tissue. Expression was seen in skin, salivary gland and to a much lesser degree in 
trachea. 

Expression of GABAtt in breast tumors was also a novel observation. 
While the expression of several genes complemented that seen with mammaglobin, two 
genes in particular, B305D and GABAtc added to the diagnostic sensitivity of 
20 mammaglobin detection. A combination of these three genes detected 45/46 (97.8%) 
breast tumors and metastases evaluated. Inclusion of B726P enabled the detection of all 
25 of the breast tumors and 21 distant metastases. 



EXAMPLES 

25 Enrichment of Circulating Breast Cancer Cells by Immunocapture 

This example discloses the enhanced sensitivity achieved by use of the 
immunocapture cell capture methodology for enrichment of circulating breast cancer 
cells. 

To evaluate the presence of circulating tumor cells an immunocapture 
30 method was adopted to first enrich for epithelial cells prior to RT-PCR analysis. 
Epithelial cells were enriched fi^om blood samples with an immunomagnetic bead 
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separation method (Dynal A.S, Oslo, Norway) utili2dng magnetic beads coated with 
monoclonal antibodies specific for glycopolypeptide antigens on the surface of human 
epithelial cells. (Exemplary suitable cell-surface antigens are described, for example, in 
Momburg, F. et al., Cancer Res.. 41:2883-91 (1997); Naume, B. et al.. Journal of 
5 Hemotherapy, 6:103-113 (1997); Naume, B. et al., Int J Cancer. 78:556-60 (1998); 
Martin, V.M. et al., Exp HematoL, 26:252-64 (1998); Hildebrandt, M. et al., Exp 
Hematol 25:57-65 (1997); Eaton, M.C. et al., Biotechniqms 22:100-5 (1997); Brandt, 
B. et al., Clin Exp Metastases 14:399-408 (1996), each of which are incorporated herein 
by reference in their entirety. Cells isolated this way were lysed and the magnetic beads 

10 removed. The lysate was then processed for poly mRNA isolation using magnetic 
beads (Dynabeads) coated with Oligo (dT) 25. After washing the beads in the kit buffer 
bead/poly A"^RNA samples were finally suspended in lOmM Tris HCl pH 8 and 
subjected to reverse transcription. The RNA was then subjected to Real time PCR 
using gene specific primers and probes with reaction conditions as outlined herein 

1 5 above, p- Actin content was also detemiined and used for normalization. Samples with 
gene of interest copies/ng P-actin greater than the mean of the normal samples + 3 
standard deviations were considered positive. Real time PCR on blood samples was 
performed exclusively using the Taqman™ procedure but extending to 50 cycles. 

Mammaglobin mRNA using enrichment procedures was found to be 

20 detectable at much lower levels than when direct isolation was used. Whole blood 
samples fi*om patients with metatastic breast cancer were subsequently treated with the 
immunomagnetic beads, polyA"*^ RNA was then isolated, cDNA made and run in 
quantitative PCR using two gene specific primers to mammaglobin and a fluorescent 
probe (Taqman™). As observed in breast cancer tissues, complementation was also 

25 seen in the detection of circulating tumor cells derived from breast cancers. Again, 
mammaglobin PCR detected circulating tumor cells in a high percentage of bloods, 
albeit at low levels, from metastatic breast cancer (20/32) when compared to the normal 
blood samples. Several of the other genes tested to date could fijrther increase this 
detection rate; this includes B726P, B305D, B311D, B533S and GABA71. A 

30 combination of all the genes tested indicates that 27/32 samples were positive by one or 
more of these genes. 
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EXAMPLE 9 
Multiplex Detection of Breast Tumors 
Additional Multiplex Real-time PCR assays were established in order to 
5 simultaneously detect the expression of four breast cancer-specific genes: LipophilinB, 
Gaba (B899P), B305D-C and B726P. In contrast to detection approaches relying on 
expression analysis of single breast cancer-specific genes, this Multiplex assay was able 
to detect all breast tumor samples tested. 

This Multiplex assay was designed to detect LipophilinB expression 
10 instead of Mammaglobin. Due to their similai- expression profiles, LipophilinB can 
replace Mammaglobin in this Multiplex PCR assay for breast cancer detection. The 
assay was carried out as follows: LipophilinB, B899P (Gaba), B305D, and B726P 
specific primers, and specific Taqman probes, were used to analyze their combined 
mRNA expression profile in breast tumors. The primers and probes are shown below: 
15 LipophilinB: Forward Primer (SEQ ID NO: 33): 5' 

TGCCCCTCCGGAAGCT. Reverse Primer (SEQ ID NO:34): 5' 
CGTTTCTGAAGGGACATCTGATC. Probe (SEQ ID NO: 35) (FAM-5' - 3'- 
TAMRA): TTGCAGCCAAGTTAGGAGTGAAGAGATGCA. 

GABA (B899P): Forward Primer (SEQ ID NO: 36): 5' 
20 AAGCCTCAGAGTCCTTCCAGTATG. Reverse Primer (SEQ ID NO: 37): 5' 
TTCAAATATAAGTGAAGAAAAAATTAGTAGATCAA. Probe (SEQ ID NO: 38) 
(FAM-5' - 3*-TAMRA): 

AATCCATTGTATCTTAGAACCGAGGGATTTGTTTAGA. 

B305D (C form): Forward Primer (SEQ ID NO: 39): 5' 
25 AAAGCAGATGGTGGTTGAGGTT. Reverse Primer (SEQ ID NO: 40): 5' 
CCTGAGACCAAATGGCTTCTTC. Probe (SEQ ID NO: 41) (FAM-5' - 3'-TAMRA) 
ATTCCATGCCGGCTGCTTCTTCTG. 

B726P: Forward Primer (SEQ ID NO: 42): 5' 
TCTGGTTTTCTCATTCTTTATTCATTTATT. Reverse Primer (SEQ ID NO: 43): 5' 
30 TGCCAAGGAGCGGATTATCT. Probe (SEQ ID NO: 44) (FAM-5' - 3'-TAMRA): 
CAACCACGTGACAAACACTGGAATTACAGG. 
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Actin: Forward Primer (SEQ ID NO: 45): 5' 
ACTGGAACGGTGAAGGTGACA. Reverse Primer (SEQ ID NO 46): 5' 
CGGCCACATTGTGAACTTTG. Probe (SEQ ID NO: 47): (FAM-5' - 3'-TAMRA): 
CAGTCGGTTGGAGCGAGCATCCC. 

The assay conditions were: 

Taqman protocol (7700 Perkin Elmer) : 

In 25 ^il final volume: 1 x Buffer A, 5mM MgCl, 0.2 mM dCTP, 0.2 mM 
dATP, 0.4 mM dUTP, 0.2 mM dGTP, 0.01 U/^l AmpErase UNG, 0.025 u/^il TaqGold, 
8% (v/v) Glycerol, 0.05% (v/v) Gelatin, 0.01% (v/v) Tween20, 4 pmol of each gene 
specific Taqman probe (LipophilinB + Gaba + B305D + B726P), 100 nM of B726P-F + 
B726P-R, 300 nM of Gaba-R, and 50 nM of LipophilinB-F + LipophilinB-R + B305D- 
R + Gaba-R, template cDNA (originating fi-om 0.02 M-g polyA + RNA). 

LipophilinB expression was detected in 14 out of 27 breast tumor 
samples. 

However, the Multiplex assay for LipopliilinB, B899P, B305D-C and B726P detected 
an expression signal in 27 out of 27 tumors with the detection level above 1 0 mRNA 
copies/1000 pg actin in the majority of samples and above 100 mRNA copies/1000 pg 
actin in 5 out of the 27 samples tested (Figure 8). 

EXAMPLE 10 
Multiplex Detection Optimization 

The Multiplex Real-time PGR assay described above was used to detect 
the expression of Mammaglobin (or LipophilinB), Gaba (B899P), B305D-C and B726P 
simultaneously. According to this Example, assay conditions and primer sequences 
were optimized to achieve parallel amplification of four PGR products with diiferent 
lengths. Positive samples of this assay can be further characterized by gel 
electrophoresis and the expressed gene(s) of interest can be determined according to the 
detected amplicon size(s). 

Mammaglobin (or LipophilinB), Gaba (B899P), B305D and B726P 
specific primers and specific Taqman probes were used to simultaneously detect their 
expression. The primers and probes used in this example are shown below. 
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Manimaglobin: Forward Primer (SEQ ID NO: 48): 5' 
TGCCATAGATGAATTGAAGGAATG. Reverse Primer (SEQ ID NO: 49): 5' 
TGTCATATATTAATTGCATAAACACCTCA. Probe (SEQ ID NO: 50): (FAM-5' - 
3'-TAMRA):TCTTAACCAAACGGATGAAACTCTGAGCAATG. 
5 GABA (B899P): Forward Primer (SEQ ID NO: 36): 5' 

AAGCCTCAGAGTCCTTCCAGTATG. Reverse Primer (SEQ ID NO: 51): 5' 
ATCATTGAAAATTCAAATATAAGTGAAG. Probe (SEQ ID NO: 38) (FAM-5' - 
3 '-TAMRA) AATCCATTGTATCTTAGAACCGAGGGATTTGTTTAGA. 

B305D (C form): Forward Primer (SEQ ID NO: 39): 5' 
10 AAAGCAGATGGTGGTTGAGGTT. Reverse Primer (SEQ ID NO: 40): 5' 
CCTGAGACCAAATGGCTTCTTC. Probe (SEQ ID NO: 41): (FAM-5' - 3'- 
TAMRA): ATTCCATGCCGGCTGCTTCTTCTG. 

B726P: Forward Primer (SEQ ID NO: 52): 5' 
GTAGTTGTGCATTGAAATAATTATCATTAT. Reverse Primer (SEQ ID NO: 43): 
15 5' TGCCAAGGAGCGGATTATCT. Probe (SEQ ID NO: 44) (FAM-5' - 3'- 
TAMRA): CAACCACGTGACAAACACTGGAATTACAGG. 

Primer locations and assay conditions were optimized to achieve parallel 
amplification of foiir PGR products with different sizes. The assay conditions were: 
Taaman protocol (7700 Perkin Elmer) : 
20 In 25 \i\ final volume: Ix Buffer A, 5 mM MgCl, 0.2 mM dCTP, 0.2 

mM dATP, 0.4 mM dUTP, 0.2 mM dGTP, 0.01 U/^l AmpErase UNG, 0.0375 U/\i\ 
TaqGold, 8% (v/v) Glycerol, 0.05% (v/v) Gelatin, 0.01% (v/v) Tween20, 4 pmol of 
each gene specific Taqman probe (Mammaglobin + Gaba + B305D + B726P), 300 nM 
of Gaba-R + Gaba-F, 100 nM of Mammaglobin-F + R; B726P-F + R, and 50 nM of 
25 B305D-F + R template cDNA (originating fi-om 0.02 (ng polyA + RNA). 
PCR vrotocol : 

50° for 2': X 1,95° for 10': X 1, and 95° for 15" / 60°for 1' / 68° for 1': 

x50. 

Since each primer set in the multiplex assay results in a band of unique 
30 length, expression signals of the four genes of interest can be measured individually by 
agarose gel analysis {see. Figure 9), or the combined expression signal of all four genes 
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can be measured in real-time on an ABl 7700 Prism sequence detection system (PE 
Biosystems, Foster City, CA). The expression of LipophilinB can also be detected 
instead of Mammaglobin. Although specific primers have been described herein, 
different primer sequences, different primer or probe labeling and different detection 
5 systems could be used to perform this multiplex assay. For example, a second 
fluorogenic reporter dye could be incorporated for parallel detection of a reference gene 
by real-time PGR. Or, for example a SYBR Green detection system could be used 
instead of the Taqman probe approach. 

10 EXAMPLE 11 

Design and Use of Genomic DNA-Excluding, Intron-Exon Border Spanning 
Primer Rairs for Breast Cancer Multiplex Assay 
The Multiplex Real-time PGR assay described herein can detect the 
expression of Mammaglobin, Gaba (B899P), B305D-C and B726P simultaneously. 

1 5 The combined expression levels of these genes is measured in real-time on an ABI 7700 
Prism sequence detection system (PE Biosystems, Foster City, CA). Individually 
expressed genes can also be identified due to different amplicon sizes via gel 
electrophoresis. In order to use this assay with samples derived fi:om non-DNase 
treated RNAs (e.g. lymph node cDNA) and to avoid DNase-treatment for small RNA- 

20 samples (e.g. fi-om blood specimens, tumor and lymph node aspirates), intron-spanning 
primer pairs have been designed to exclude the amplification of genomic DNA and 
therefore to eliminate nonspecific and false positive signals. False positive signal is 
caused by genomic DNA contamination in cDNA specimens. The optimized Multiplex 
assay described herein excludes the amplification of genomic DNA and allows specific 

25 detection of target gene expression without the necessity of prior DNase treatment of 
RNA samples. Moreover the genomic match and the location of the Intron-Exon border 
could be verified with these primer sets. 

Mammaglobin, Gaba (B899P), B305D and B726P specific primers and 
specific Taqman probes were used to simultaneously detect their expression (Table 7). 

30 Primer locations were optimized (Intron-Exon border spanning) to exclusively detect 
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cDNA and to exclude genomic DNA from amplification. The identity of the expressed 
gene(s) was determined by gel electrophoresis. 

Table 7 



5 Intron-Exon border Spanning Primer and Probe Sequences 
for Breast Tumor Multiples Assay 



Gene 


Forward Primer 


Reverse Primer 


Taqman probe 
(FAM-5' - 3'TAMRA) 


Maminaglobin 


tgc catagatgaattgaagga 
atg (SEQ ID NO:48) 


tgtcatatattaattgcataaacacct 
ca (SEQIDNO:49) 


tcttaaccaaacggatgaaactctgagca 
atg (SEQ IDNaSO) 


B899P 


aagcctcagagtccttccagta 
tg (SEQ1DN0:36) 


ttcaaatataagtgaagaaaaaatta 
gtagatcaa (SEQ ID 
NO:37) 


aatccattgtatcttagaaccgagggattt 
gttt(SEQ IDNO:62) 


B305D 


aaagcagatggtggttgaggt 
t (SEQ ID NO:39) 


cctgagaccaaatggcttcttc 
(SEQ IDNO:40) 


attccatgccggctgcttcttctg (SEQ 
ID N0:41) 


B726P 


tctggttttctcattctttattcatt 
tatt (SEQ IDNO:42) 


tgccaaggagcggattatct 
(SEQ ID NO:43) 


caaccacgtgacaaacactggaattaca 
gg(SEQlDNO:44) 


Actin 


actggaacggtgaaggtgac 
a(SEQlDNO:45 


cggccacattgtgaactttg 
(SEQ IDNO:46) 


cagtcggttggagcgagcatccc 
(SEQ IDNO:47) 


B899P-INT 


caattttggtggagaacccg 
(SEQ1DN0:53) 


gctgtcggaggtatatggtg 
(SEQ IDNO:54) 


catttcagagagtaacatggactacaca 
(SEQ ID NO:55) 


B305D-1NT 


tctgataaaggccgtacaatg 
(SEQ1DN0:56) 


tcacgacttgctgtttttgctc 
(SEQIDNO:57) 


atcaaaaaacaagcatggcctcacac ca 
ct(SEQlDNO:58) 


B726P-INT 


gcaagtgccaatgatcagagg 
(SEQ 1DN0:59) 


atatagactcaggtatacacact 
(SEQ ID NO:60) 


tcccatcagaatccaaacaagaggaaga 
tg{SEQ IDN0:61) 



Primer locations and assay conditions were optimized to achieve parallel 
amplification of the four PCR products. The assay conditions were as follows: 

10 Togman protocol (7700 Perkin Elmer) 

In 25^1 final volume: Ix Buffer A, 5 mM MgCl, 0.2 mM dCTP, 0.2 mM 
dATP, 0.4 mM dUTP, 0.2 mM dGTP, 0.01 U/AmpErase UNG, 8 % (v/v) Glycerol, 
0.05 % (v/v) Gelatin, 0.01 % (v/v) Tween20, 4 pmol of each gene specific Taqman 
probe (Mammaglobin + B899P-INT + B305D-INT + B726P-INT), 300 nM of B305D- 

15 INT-F; B899P-INT-F, 100 nM of Mammaglobin^F + R; B726P-.INT-F +R, 50 nM of 
B899P-INT-R; B305D-INT-R, template cDNA (originating firom 0.02 \ig polyA+ 
RNA). 

PCR cycling conditions 

1 cycle at 50°C for 2 minutes, 1 cycle at 95°C for 10 minutes, 50 cycles 
20 of 95*^0 for 1 minute and eS^'C for 1 minute. 

Figure 1 0 shows a comparison of the multiplex assay using intron-exon 
border spanning primers (bottom panel) and the multiplex assay using non-optimized 
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primers (top panel), to detect breast cancer cells in a panel of lymph node tissues. This 
experiment shows that reduction in backgroxmd resulting from genomic DNA 
contamination in samples is achieved using the intron-exon spanning primers of the 
present invention, 

5 

EXAMPLE 12 

Multiplex Detection of Metastasized Breast Tumor Cells in 
Sentinel Lymph Node Biopsy Samples 
Lymph node staging is important for determining appropriate adjuvant 

10 hormone and chemotherapy. In contrast to conventional axillary dissection a less 
invasive approach for staging of minimal residual disease is sentinel lymph node 
biopsy. Sentinel lymph node biopsy (SLNB) has the potential to improve detection of 
metastases and to provide prognostic values to lead to therapy with minimal morbidity 
associated with complete lymph node dissection. SLNB implements mapping of the 

1 5 one or two lymph nodes which primarily drain the tumor and therefore are most likely 
to harbor metastatic disease (the sentinel nodes). Routine pathological' analysis of 
lymph nodes result in a high false-negative rate: one-third of women with 
pathologically negative lymph nodes develop recurrent disease [Bland: The Breast: 
Saunders 1991]. A more sensitive detection technique for tumor cells would be RT- 

20 PGR but its application is limited by lack of a single specific markers. The multimarker 
assay described above increases the likelihood of cancer detection across the population 
without producing false-positive results from normal lymph nodes. 

As mentioned above, lymphatic afferents from a primary tumor drain 
into a single node, the sentinel lymph node, before drainage into the re^onal lymphatic 

25 basin occurs. Sentinel lymph nodes are located with dyes and/or radiolabelled colloid 
injected in the primary lesion site and sentinel lymph node biopsy allows pathological 
examination for micrometastatic deposits, staging of the axilla and therefore can avoid 
unnecessary axillary dissection. Nodal micrometastases can be located with staining 
(haematoxylin or eosin) or immxmohistochemical analysis for cytokeratin proteins. 

30 Immunocytochemical staining techniques can produce frequent false-negative results by 
missing small metastatic foci due to inadequate sectioning of the node. 
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Immunohistochemistry can result in false-positive results due to illegitimate expression 
of cytokeratins (reticulum cells) or in false-negative results when using the antibody 
Ber-Ep4 which corresponding antigen is not expressed on all tumor cells. 

The multiplex assay described herein could provide a more sensitive 
5 detection tool for positive sentinel lymph nodes. Moreover the detection of breast 
cancer cells in bone marrow samples, peripheral blood and small needle aspiration 
samples is desirable for diagnosis and prognosis in breast cancer patients. 

Twenty-two metastatic lymph node samples, in addition to 15 samples 
also previously analyzed and shovwi in Figure 3A, were analyzed using the intron-exon 
10 border spanning multiplex PCR assay described herein. The results from this analysis 
are summarized in Table 8. Twenty-seven primary tumors were also analyzed and the 
results shown in Table 9. Twenty normal lymph node samples tested using this assay 
were all negative. 



15 
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Table 8. 



Multiples Real-time PCR Analysis of 37 Metastatic Lymph Nodes 



breast metastatic 
lymph node samples 


Mammaglobin 


B305D 


B899P 


B726P 


Multiplex 


B.Met317A 


++ 








+-H- 


B.Met318A 






-H- 




4-H- 


B.Met 595A 


+ 








+-H- 


B.Met611A 


+ 


+ 


+++ 




-H- 


B.Met 612A 


-H- 


-H- 




+ 


++ 


B.Met614A 




++ 




++ 


+++ 


B.Met616A 






+ 




-H- 


B.Met 61 8A 


+++ 


+ 






+++ 


B.Met 620A 


++ 


++ 




++ 


-hH- 


B.Met 621 A 


+ 


+-H- 




+ 


+++ 


B.Met 624A 






++ 




-H-+ 


B.Met 625A 




++ 




-H- 


+ 


B.Met 627A 




+ 




+ 


+ 


B.Met 629A 


-H- 








+-H- 


B.Met 631 A 


+ 




-H- 




+ 


1255 


-H-+ 


++ 




-H- 


-H- 


1257 


-hH- 


+ 


+ 


+ 


++ 


769 


-H-+ 






+ 


-H- 


1258 


-H- 


+ 


+ 




+ 


1259 




-H- 


-H- 




+++ 


1250 


-H-+ 


+ 




+ 


+++ 


1726 


+++ 


+ 




+ 




786 


-HH- 


+ 


+ 




+++ 


281-LI-r 


+++ 








+++ 


289-L2 


++ 


+ 






-H- 


366-S 


+ 








+ 


374-S+ 


+++ 


++ 








376-S 


++ 






+ 


-H- 


381-S 


+ 


+ 






+ 


383-Sx 


-H-+ 


++ 






+++ 


496-M 


-H-+ 


-H- 






++4- 


591-SI-A 


+ 


+ 






+ 


652-1 




+ 


-H- 




+++ 


772 










+ 


777 


+ 


+ 




++ 


-H- 


778 


+++ 








+-H- 


1 779 






++ 




-H- 
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Table 9 



tumor samples 


iVlcUl liliagl U Ul 

n 








iVXUllipiCA 


T443 


+ 


++ 




+++ 


+++ 


T457 




+ 


+ 




++ 


T395 










-H- 


TlOA 


+ 


+++ 




-t-H- 


+++ 


T446 




+ 




++ 


++ 


TliC 


+ 




+-H- 




-H-t- 


T23B 


+ 


++ 






-H-t- 


T207A 




++ 
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From the foregoing it will be appreciated that, although specific 
5 embodiments of the invention have been described herein for purposes of illustration, 
various modifications may be made without deviating from the spirit and scope of the 
invention. Accordingly, the invention is not limited except as by the appended claims. 
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CLAIMS 

We Claim: 

1. A method for identifying one or more tissue- 
specific polynucleotides, said method comprising the steps of: 

(a) performing a genetic subtraction to identify a pool of 
polynucleotides from a tissue of interest; 

(b) performing a DNA microarray analysis to identify a first subset 
of said pool of polynucleotides of interest wherein each member polynucleotide of said 
first subset is at least two-fold over-expressed in said tissue of interest as compared to a 
control tissue; and 

(c) performing a quantitative polymerase chain reaction (PCR) 
analysis on polynucleotides within said first subset to identify a second subset of 
polynucleotides that are at least two-fold over-expressed as compared to said control 
tissue; 

wherein a polynucleotide is identified as tissue-specific if it is at least two-fold over- 
expressed by both microarray and quantitative PCR analyses. 

2. The method of claim 1 wherein said genetic subtraction is 
selected from the group consisting of differential display and cDNA subtraction. 

3. A method for identifying a subset of polynucleotides showing 
complementary tissue-specific expression profiles in a tissue of interest, said method 
comprising the steps of: 

(a) performing a first expression analysis selected from the group 
consisting of DNA microarray and quantitative PCR to identify a first polynucleotide 
that is at least two-fold over-expressed in a first tissue sample of interest obtained from 
a first patient but not over-expressed in a second tissue sample of interest as compared 
to a control tissue; and 
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(b) performing a second expression analysis selected from the group 
consisting of DNA microarray and quantitative PGR to identify a second polynucleotide 
that is at least two-fold over-expressed in a second tissue sample of interest obtained 
from a second patient but not over-expressed in a first tissue sample of interest as 
compared to said control tissue; 

wherein the first tissue sample and said second tissue sample are of the same tissue 
type, and wherein over-expression of said first polynucleotide in only said first tissue 
samples of interest and over-expression of said second polynucleotide in only said 
second tissue sample of interest indicates complementary tissue-specific expression of 
said first polynucleotide and said second polynucleotide. 

4. A method for determining the presence of a cancer cell in a 
patient, said method comprising the steps of: 

(a) obtaining a biological sample from said patient; 

(b) contacting said biological sample with a first oligonucleotide that 
hybridizes to a first polynucleotide said first polynucleotide selected from the group 
consistmg of polynucleotides depicted in SEQ ID NO:73, SEQ ID NO:74 and SEQ ID 
NO:76; 

(c) contacting said biological sample with a second oligonucleotide 
tliat hybridizes to a second polynucleotide selected from the group consisting of SEQ 
ID NO: 1, 3, 5-7, 11, 13, 15, 17, 19-24, 30, 32, and 75; 

(d) detecting in said sample an amoxmt of a polynucleotide that 
hybridizes to at least one of said oligonucleotides; and 

(e) comparing the amount of the polynucleotide that hybridizes to 
said oligonucleotide to a predetermined cut-off value, and therefrom determining the 
presence or absence of a cancer in the patient. 

5. A method for determining the presence or absence of a cancer in 
a patient, said method comprising the steps of: 

(a) obtaming a biological sample from said patient; 
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(b) contacting said biological sample with a first oligonucleotide that 
hybridizes to a first polynucleotide selected from the group consisting of 
polynucleotides depicted in SEQ ID NO:73, SEQ ID NO:74 and SEQ ID NO:76; 

(c) contacting said biological sample with a second oligonucleotide 
that hybridizes to a second polynucleotide as depicted in SEQ ID NO:75; 

(d) contacting said biological sample with a third oligonucleotide 
that hybridizes to a third polynucleotide selected from the group consisting of 
polynucleotides depicted m SEQ ID NO:5, SEQ ID N0:6 and SEQ ID N0:7; 

(e) contacting said biological sample with a fourth oligonucleotide 
that hybridizes to a fourth polynucleotide selected from the group consisting of 
polynucleotides depicted in SEQ ID N0:13, SEQ ID N0:15, SEQ ID NO:17, SEQ ID 
N0:19, SEQ ID NO:20, SEQ ID N0:21, SEQ ID NO:22, SEQ ID NO:23 and SEQ ID 
NO:24; 

(f) detecting in said biological sample an amount of a polynucleotide 
that hybridizes to at least one of said oligonucleotides; and 

(g) comparing the amount of polynucleotide that hybridizes to the 
oligonucleotide to a predetermined cut-off value, and therefrom determining the 
presence or absence of a cancer in the patient. 

. 6. A method for determining the presence or absence of a cancer in 
a patient, said method comprising the steps of: 

(a) obtaining a biological sample from said patient; 

(b) contacting said biological sample with an oligonucleotide that 
hybridizes to a tissue-specific polynucleotide; 

(c) detecting in the sample a level of a polynucleotide that hybridizes 
to the oligonucleotide; and 

(d) comparing the level of polynucleotide that hybridizes to the 
oligonucleotide with a predetermined cut-off value, and therefrom determining the 
presence or absence of a cancer in the patient. 
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7. A method for monitoring the progression of a cancer in a patient, 
said method comprising the steps of: 

(a) obtaining a first biological sample from said patient; 

(b) contacting said biological sample with an oligonucleotide that 
hybridizes to a polynucleotide that encodes a breast tumor protein; 

(c) detecting in the sample an amount of said polynucleotide that 
hybridizes to said oligonucleotide; 

(d) repeating steps (b) and (c) using a second biological sample 
obtained from said patient at a subsequent point in time; and 

(e) comparing the amount of polynucleotide detected in step (d) with 
the amoimt detected in step (c) and therefrom monitoring the progression of the cancer 
in the patient. 

8. The method any one of claim 6 and claim 7 wherein said 
polynucleotide encodes a breast tumor protein selected from the group consisting of 
mammaglobin, lipophilin B, GABAti (B899PX B726P, B511S, B533S, B305D and 
B311D. 

9. A method for detecting the presence of a cancer cell in a patient, 
said metliod comprising the steps of: 

(a) obtaining a biological sample from said patient; 

(b) contacting said biological sample with a first oligonucleotide that 
hybridizes to a first polynucleotide selected from the group consisting of mammaglobin 
and lipophilin B; 

(c) contacting said biological sample with a second oligonucleotide 
that hybridizes to a second polynucleotide sequence selected from the group consisting 
of GABAti (B899P), B726P, B51 IS, B533S, B305D and B31 ID; 

(d) detecting in said biological sample an amount of a polynucleotide 
that hybridizes to at least one of the oligonucleotides; and 
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(e) comparing the amount of polynucleotide that hybridizes to the 
oligonucleotide to a predetermined cut-off value, and therefrom determining the 
presence or absence of a cancer in the patient. 

10. A method for determining the presence of a cancer cell in a 
patient, said method comprising the steps of: 

(a) obtaining a biological sample from said patient; 

(b) contacting said biological sample with a first oligonucleotide that 
hybridizes to a first polynucleotide selected fi-om the group consisting of a 
polynucleotide depicted in SEQ ID NO:73 and SEQ ID NO:74 or complement thereof; 

(c) contacting said biological sample with a second oligonucleotide 
that hybridizes to a second polynucleotide depicted in SEQ ID NO:75 or complement 
thereof; 

(d) contacting said biological sample with a third oligonucleotide 
that hybridizes to a third polynucleotide selected firom the group consistmg of a 
polynucleotide depicted in SEQ ID NO:l, SEQ ID N0:3, SEQ ID NO:5, SEQ ID NO:6 
and SEQ ID N0:7 or complement thereof; 

(e) contacting said biological sample with a fourth oligonucleotide 
that hybridizes to a fourth polynucleotide selected from the group consisting of a 
polynucleotide depicted in SEQ ID NO:l 1 or complement thereof; 

(f) contactmg said biological sample with a fifth oligonucleotide that 
hybridizes to a fifth polynucleotide selected fi-om the group consisting of a 
polynucleotide depicted in SEQ ID NO: 13, 15 and 17 or complement thereof; 

(g) contacting said biological sample with a sixth oligonucleotide 
that hybridizes to a sixth polynucleotide selected fi-om the group consisting of a 
polynucleotide depicted in SEQ ID NO:19, SEQ ID NO:20, SEQ ID NO:21, SEQ ID 
NO:22, SEQ ID NO:23 and SEQ ID NO:24 or complement thereof; 

(h) contacting said biological sample with a seventh oligonucleotide 
that hybridizes to a seventh polynucleotide depicted in SEQ ID NO:30 or complement 
thereof; 
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(i) contacting said biological sample with an eighth oligonucleotide 
that hybridizes to an eighth polynucleotide depicted in SEQ ID NO;32 or complement 
thereof; 

(j) contacting said biological sample with a ninth oligonucleotide 
that hybridizes to a polynucleotide depicted in SEQ ID NO:76 or complement thereof; 

(k) detecting in said biological sample a hybridized oligonucleotide 
of any one of steps (b) through (j) and comparing the amount of polynucleotide that 
hybridizes to the oligonucleotide to a predetermined cut-off value, 
wherein the presence of a hybridized oligonucleotide in any one of steps (b) through (j) 
in excess of the pre-determined cut-off value indicates the presence of a cancer cell in 
the biological sample of said patient. 

11. A method for determining the presence of a cancer cell in a 
patient, said method comprising the steps of: 

(a) obtaining a biological sample from said patient; 

(b) contactmg said biological sample v^th a first oligonucleotide and 
a second oligonucleotide; 

i. wherein said first oligonucleotide and said second 
oligonucleotide hybridize to a first polynucleotide and a second polynucleotide, 
respectively; 

ii. wherein said first polynucleotide and said second 
polynucleotide are selected from the group consisting of polynucleotides deptided in 
SEQ ID NO:73, SEQ ID NO:74, SEQ ID NO:75, SEQ ID N0:1, SEQ ID NO:3, SEQ 
ID N0;5, SEQ ID N0:6, SEQ ID N0:7, SEQ ID NO:ll, SEQ ID NO: 13, SEQ ID 
N0:15, SEQ ID N0:17, SEQ ID N0:19, SEQ ID NO:20, SEQ ID N0:21, SEQ ID 
NO:22, SEQ ID NO:23, SEQ ID NO:24, SEQ ID NO:30, SEQ ID NO:32, and SEQ ID 
NO:76; and 

iii. wherein said first polynucleotide is unrelated in 
nucleotide 

sequence to said second polynucleotide; 
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(c) detecting in said biological sample said hybridized first 
oligonucleotide and said hybridized second hybridized oligonucleotide; and 

(d) comparing the amount of said hybridized first oligonucleotide 
and said hybridized second hybridized oligonucleotide to a predetermined cut-off value; 
wherein an amount of said hybridized first oligonucleotide or said hybridized second 
oligonucleotide in excess of the pre-determined cut-ofiF value indicates the presence of a 
cancer cell in the biological sample of said patient. 

12. A method for determining the presence or absence of a cancer 
cell in a patient, said method comprising the steps of: 

(a) obtaining a biological sample from said patient; 

(b) contacting said biological sample with a first oligonucleotide and 
a 

second oligonucleotide; 

i. wherein said first oligonucleotide and said second 
oligonucleotide hybridize to a fii*st polynucleotide and a second polynucleotide, 
respectively; 

ii. wherein said first pol3Tiucleotide and said second 
polynucleotide are both tissue-specific polynucleotides of the cancer cell to be detected; 
and 

iii. wherein said first polynucleotide is unrelated in 
nucleotide 

sequence to said second polynucleotide; 

(c) detecting in said biological sample said first hybridized 
oligonucleotide and said second hybridized oligonucleotide; and 

(d) comparing the amount of polynucleotide that hybridizes to the 
oligonucleotide to a predetermined cut-off value, wherein the presence of a hybridized 
first oligonucleotide or a hybridized second oligonucleotide in excess of the pre- 
determined cut-oflf value indicates the presence of a cancer cell in the biological sample 
of said patient. 
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13. A method for detecting the presence of a cancer cell in a patient, 
said method comprising the steps of: 

(a) obtaining a biological sample from said patient; 

(b) contacting said biological sample with a first oligonucleotide pair 
said first pair comprising a first oligonucleotide and a second oligonucleotide wherein 
said first oligonucleotide and said second oligonucleotide hybridize to a first 
polynucleotide and the complement thereof, respectively; 

(c) contacting said biological sample with a second oligonucleotide 
pair said second pair comprising a third oligonucleotide and a fourth oligonucleotide 
wherein said third and said fourth oligonucleotide hybridize to a second polynucleotide 
and the complement thereof, respectively, and wherein said first polynucleotide is 
unrelated in nucleotide sequence to said second polynucleotide; 

(d) amplifying said first polynucleotide and said second 
polynucleotide; 

and 

(e) detecting said amplified first polynucleotide and said amplified 
second polynucleotide; 

wherein the presence of said amplified first polynucleotide or said amplified second 
polynucleotide indicates the presence of a cancer cell in said patient. 

14. The method of any one of claims 4-7 and 9-13 wherein said 
biological sample is selected from the group consisting of blood, serum, lymph node, 
bone marrow, sputum, urine and tumor biopsy sample. 

15. The method of claim 14 wherein said biological sample is 
selected from the group consisting of blood, a lymph node and bone marrow. 

16. The method of claim 15 wherein said lymph node is a sentinel 

lymph node. 
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17. The method of any one of claims 4-7 and 9-13 wherein said 
cancer is selected from the group consisting of prostate cancer, breast cancer, colon 
cancer, ovarian cancer, lung cancer head & neck cancer, lymphoma, leukemia, 
melanoma, liver cancer, gastric cancer, kidney cancer, bladder cancer, pancreatic cancer 
and endometrial cancer. 

18. The method of any one of claims 12 and 13 wherem said first 
polynucleotide and said second polynucleotide are selected from the group consisting of 
mammaglobin, lipophilin B, GABAtc (B899P), B726P, B511S, B533S, B305D and 
B311D. 

19. The method of any one of claims 12 and 13 wherein said first 
polynucleotide and said second polynucleotide are selected from the group consisting of 
polynucleotide depicted m SEQ ID NO:73, SEQ ID NO:74, SEQ ID NO:75, SEQ ID 
N0:1, SEQ ID N0:3, SEQ ID N0:5, SEQ ID N0:6, SEQ ID NO:7, SEQ ID NOrll, 
SEQ ID NO:13, SEQ ID N0:15, SEQ ID N0:17, SEQ ID NO:19, SEQ ID NO:20, SEQ 
ID N0:21, SEQ ID NO:22, SEQ ID NO:23, SEQ ID NO:24, SEQ ID NO:30, SEQ ID 
NO:32, and SEQIDNO:76. 

20. The method of any one of claims 12 and 13 wherein said 
oligonucleotides are selected from the group consisting of oligonucleotides depicted in 
SEQ ID N0s:33-35 and 63-72. 

21. The method of any one of claims 12 and 13 wherein the step of 
detection of said first amplified polynucleotide and said second polynucleotide 
comprises a step selected from the group consisting of detecting a radiolabel and 
detecting a fluorophore. 

22. The method of any one of claims 4-7 and 9-13 wherein said step 
of detection comprises a step of fractionation. 
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23. The method of any one of claims 12 and 13 wherein said first and 
said oligonucleotides are intron spanning oligonucleotides. 

24. The method of claim 23 wherein said intron spanning 
oligonucleotides are selected from the group consisting of oligonucleotides depicted in 
SEQ ID NOs:36-62. 

25. The method of claim 13 wherein detection of said amplified first 
or said second polynucleotide comprises contacting said amplified first or said second 
polynucleotide with a labeled oligonucleotide probe that hybridizes, under moderately 
stringent conditions, to said first or said second polynucleotide. 

26. The method of claim 13 wherein said labeled oligonucleotide 
probe comprises a detectable moiety selected fi"om the group consisting of a radiolabel 
and a fluorophore. 

27. The method of any one of claims 4-7 and 9-13 further comprising 
a step of enriching said cancer cell from said biological sample prior to hybridizing said 
oligonucleotide primer(s). 

28. The method of claim 27 wherein said step of enriching said 
cancer cell from said biological sample is achieved by a methodology selected from the 
group consisting of cell capture and cell depletion. 

29. The method of claim 28 wherein cell capture is achieved by 
immunocapture, said immunocapture comprising the steps of: 

(a) adsorbing an antibody to the surface of said cancer cells; and 

(b) separating said antibody adsorbed cancer cells from the 
remainder of 

said biological sample. 
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30. The method of claim 29 wherein said antibody is directed to an 
antigen selected from the group consisting of CD2, CD3, CD4j CDS, CDS, CDIO, 
CDllb, CD14, CD15, CD16, CD19, CD20, CD24, CD25, CD29, CD33, CD34, CD36, 
CD38, CD41, CD45, CD45RA, CD45RO, CD56, CD66B, CD66e, HLA-DR, IgE and 
TCRap. 

31. The method of claim 29 wherein said antibody is directed to a 
breast tumor antigen. 

32. The method of any one of claims 29-31 wherein said antibody is 
a monoclonal antibody. 

33. The method of claim 29 wherein said antibody is conjugated to 
magnetic beads. 

•) 

34. The method of claim 29 wherein said antibody is formulated in a 
tetrameric antibody complex. 

35. The method of claim 28 wherein cell depletion is achieved by a 
method comprising the steps of: 

(a) cross-linking red cells and white cells, and 

(b) fractionating said cross-linked red and white cells from the 
remainder of said biological sample. 

36. The method of claim 13 wherein said step of amplifying is 
achieved by a polynucleotide amplification methodology selected from the group 
consisting of reverse transcription polymerase chain reaction (RT-PCR), inverse PCR, 
RACE, ligase chain reaction (LCR), Qbeta Replicase, isothermal amplification, strand 
displacement amplification (SDA), rolling chain reaction (RCR), cyclic probe reaction 
(CPR), transcription-based amplification systems (TAS), nucleic acid sequence based 
amplification (NASBA) and 3SR. 
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37. A composition for detecting a cancer cell in a biological sample 
of a patient, said composition comprising: 

(a) a first oligonucleotide; and 

(b) a second oligonucleotide; 

wherein said first oligonucleotide and said second oligonucleotide hybridize to a first 
polynucleotide and to a second polynucleotide, respectively; wherein said first 
polynucleotide is unrelated in nucleotide sequence fi-om said second polynucleotide; 
and wherein said first polynucleotide and said second polynucleotide are tissue-specific 
polynucleotides of the cancer cell to be detected. 

38. The composition of claim 37 wherein said first polynucleotide 
and said second polynucleotide are complementary tissue-specific polynucleotides of 
the tissue-type of said cancer cell. 

39. The composition of any one of claim 37 and claim 38 wherein 
said first polynucleotide and said second polynucleotide are selected from the group 
consisting of the polynucleotides depicted in SEQ ID NO:73, SEQ ID NO:74, SEQ ID 
NO:75, SEQ ID N0:1, SEQ ID N0:3, SEQ ID N0:5, SEQ ID N0:6, SEQ ID NO:7, 
SEQ ID N0:1 1 , SEQ ID NO: 1 3, SEQ ID N0:1 5, SEQ ID NO: 1 7, SEQ ID NO: 1 9, SEQ 
ID NO:20, SEQ ID N0:21, SEQ ID NO:22, SEQ ID NO:23, SEQ ID NO:24, SEQ ID 
NO:30, SEQ ID NO:32, and SEQ ID NO:76. 

40. The composition of any one of claim 37 and claim 38 wherem 
said oligonucleotides are selected from the group consisting of oligonucleotides as 
disclosed m SEQ ID NOs: 33-72. 

41. A composition for detecting a cancer cell in a biological sample 
of a patient, said composition comprising: 

(a) a first oligonucleotide pair; and 

(b) a second oligonucleotide pair; 
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wherein said first oligonucleotide pair and said second oligonucleotide pair hybridize to 
a first polynucleotide (or complement thereof) and to a second polynucleotide (or 
complement thereof), respectively; wherein said first polynucleotide is unrelated in 
nucleotide sequence from said second polynucleotide; and wherein said first 
polynucleotide and said second polynucleotide are tissue-specific polynucleotides of the 
cancer cell to be detected. 

42. The composition of claim 41 wherein said first polynucleotide 
and said second polynucleotide are complementary tissue-specific polynucleotides of 
the tissue-type of said cancer cell. 

43. The composition of any one of claim 41 and claim 42 wherein 
said first polynucleotide and said second polynucleotide are selected fix)m the group 
consisting of the polynucleotides depicted in SEQ ID NO:73, SEQ ID NO:74, SEQ ID 
NO:75, SEQ ID N0:1, SEQ ID N0:3, SEQ ID N0:5, SEQ ID NO:6, SEQ ID NO:7, 
SEQ IDN0:11, SEQ IDNO:13, SEQ IDNO:15, SEQ IDN0:17, SEQ E)N0:19, SEQ 
ID NO:20, SEQ ID N0:21, SEQ ID NO:22, SEQ ID NO:23, SEQ ID NO:24, SEQ ID 
NO:30, SEQ ID NO:32, and SEQ ID NO:76. 

44. The composition of any one of claim 41 and claim 42 wherein 
said oligonucleotides are selected from the group consisting of oligonucleotides as 
disclosed in SEQ ID NOs: 33-72. 

45. A composition comprising an oligonucleotide primer or probe of 
between 15 and 100 nucleotides that comprises an oligonucleotide selected from the 
group consisting of oligonucleotides depicted in SEQ ID N0s:33-72. 

46. The composition of claim 45 comprising an oligonucleotide 
primer or probe selected from the group consisting of oligonucleotides depicted in SEQ 
IDN0s:33-72. 
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GACAGCGGCnCCTTGATimGCttCraGACTGAAlXCGACAGCAG 50 

CAGCCTCA(IATGAAGTTG(TGATGGmCAT6CTGGCQGCCCTCn:CC 100 

AGWCTGCTACGimTCTGGCTGCCCCTTATTGGAGAATGTGATnX 150 

AAGACAATWTCCAtMTGTCTAAGACTGAATACAAAGAAnrc™ 200 

AGAGnCATAGACGACAATGCCACTACAAATGCCATAGATGAATTGAAQG 250 

AATGTTHCnAAlXAAACGGATGAAACTCTGAGCAATGTTGAGGTGTTT 300 

CTGCAAnAATATATGACAGCAGTCTTTCrGATTTATTITAACTTTCTGC 350 

AAGACCTnGGCTCACAGAACTGCAGGGTATGGTGAfiAAACCAACTACGG 400 

AHGCTGIMCCACACIJrCTCnrnnTATGTCTTmACTACAAACTA 450 

(mCAATTGTimtrmATACATGTTTATTnAATAAAlTGATG 500 
GCA 503 



Fig. 4 



CACTGCTACGCAGGaCTGGCreCCCCTTATTGGAGAATGTGATrrCCAA 50 

.GACAAT(MTCCACAAGrG[rcrAAGA(IGAATA(MGAACnaT(7\AG 100 

AGHCATAGACGACAATGIXACTACAAATGCCATAGATGAATTGAAGGAA 150 

TGTmOTAACCAMCGGATGAAACTCTGAGCAATGTTGAGGTGTTTAT 200 

GCAAnAATATATGACAGCAGrCTTTGrGAnTATTTGGCGGCCATCACC 250 

ATCACCATWCTAAGGTCIXGAGCTCmTTCTGCAGATATCCATCACAC 300 
T 301 



Fig. 5 
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GGGACAGGGCmGATGAGGAGMCCOGimCCCAGAAGACCinBCCra 100 
TCMCMCTACCTQG[MmTACn(mcCCnGGmGAG(mmCMW 
CACTCAGmTGTGCATCCmGAGTCAGnCMCGTmTCeGCAGAAGTGAC^ 
AAimmCTCAQGCCCAAMGGTGGAGAACCCGTACAGATAGCGCTGACTCTG^^ 

ACACAG(XA(XATATA(nCCGA(M:GCTGGATGGAC(MGGCTGGTGTTm 500 

CTGGGTGCCAGATACnA(:ATTGTGGAGrC(MAAGT(mCCT(XATGAAGTCACT^^ 

CTGTATGCCCTCAGAATCACGACAACTGTTGCATGTAACATGGAOTaAM^^ 

GCTATGATGGAAATGATCTGGAGnWCTGGCTGAGAGGGAACGACTCTGTGCGTGGACTGGAO 

THCACCTTAGTCMAGATimGCAGGAGACAGGAAAmACTAGAnGGTCTTAC^^ 

AC(n"AGGrTC(TTraCT7T(nGGrGGrGTTGT(XTGGGmCATmGGATa"C[CT(^ 1000 

TGHATCAATGACimTGATCATCGGGTCCCGCACmmCtmMAAl^ 

C™TGTTim(mrAGAATAT6CAGn'GCTCACTACAGmcnA(V\GCAGATQ^ 

GTCAGTATTACTAATATCATCAAlMlTCCATCTCCAGCTTTAAACmAGATC^ 

ACHGACAATGAAAACCAGCGACAAGTTlMGmGTCTTCCGAGAAAAGATGGGC^ 

TCACTAnC(MCTACTGTnC(nTTGATTTTTATGCTAGCCAATGTATTnACTC^ 1500 

TGCCATAGGTCnCAACAGGACAAGATAATGATGTAAATGGTATmAGGCCA^^ 

TGAGTCmCTGCTCMGAATGAAGCTCCAACCATTGnCTAAGGBTGTAGAAG^^^ 

TCTnCATCTTAATCAAGGACAnCCCATGGAGCCCmTTAtMTGTAaCA™ 

AAGAAlBGGAAGGAGAaiAlTGGGTAAraTCAAGTGTCAGAAGTTmTC^ 

TAWnGTTGCCTAmAGGGAGTAACATTTTCTAGTTrnGmCTGGnAAAATGAAATATQGGCm^ 2000 

TCAATAC(MGATGAGTTmAAATAATGAATA™mAATACCACAACAGM^ 

AGTGAAGAAAAAAnAGTAGATCAACMCTAAACAAATCCOtGGTTCTAAGATACAAT^ 

an^AmTATOTATCATmAnATOTACACACATCCATCCTAAACTATACTAAAm 

TAACmrTCTAGAAGTCTTAATATGGGCTGnGCCATGAAGGCTOGAATTGAGrCCAm 

AAAGTACTGGGmCTCAGAGAGrCGCTGTCAmGT(:AlTG(mTACTCTAACACT& 2500 

AAGAGGAGCAmTCCCTTmCTAATGATCAGGAATGATGCTTAnAGAAAACM 

GGCTTmCAGATCTOTCCTTC(mTGGTCTGCTCTGAGTQGCraTCCCGC^^ 

AAWlBAGACAmCAGAACCTATGnCAGGnGCGGGTGAGCTGCCCTCTC^^ 

AGTAC(^GTGATnFGAGTGTGC(»GGGTAAAGG(nTa:AGrrCAGCCTCAGTTAm 

TCTAATAAAlBCACGG(:mACCmCCTGT(y\GAAATAAAam^ 3000 
CTlTT(ICAnATACTTCTCACAAnCAGmCTATGAGmGATCACCTGATT^ 
AAAGAGATCAAATGTGGTTGTATGAGCCAATCATAmGTGATTTmAAAAAAAGmAAAAfla^ 
TTTATATAAAAACAATGATAAAGATGTGAACTGTGAAATAAATATACCATAlTAGCTACaiACC^^ 3282 



Fig. 6 
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Multiplex PCR assay: 
Gene determination by onnplicon size 
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SEQUENCE LISTING 



<110> Corixa Corporation 

Houghton, Raymond L. 
Dillon, Davin C. 
Molesh, David A. 
Xu, Jiang Chun 
Zehentner, Barbara 
Per sing, David H. 



<120> METHODS, COMPOSITIONS AND KITS FOR THE DETECTION 
AND MONITORING OF BREAST CANCER 



<130> 210121. 513PC 



<140> PCT 

<141> 2001-04-02 



<160> 77 



<170> FastSEQ for Windows Version 3.0 



<210> 1 
<211> 1851 
<212> DNA 
<213> Homo sapien 



<400> 1 



tcatcaccat 
cttccgtgtt 
tcacttcctt 
ttgctgtttt 
caaattacat 
aagatacatc 
cagcaagtat 
tgattaaaaa 
ggccatgctt 
atttatcttc 
ttggatcagt 
cctttgtcag 
gcacgagttt 
tttgcttgtc 
ggactttacc 
acctgggatc 
cgctcccctg 
cttcacagag 
gtccatccag 
cagccatcaa 
acagaggatg 
cacaggtact 
aagagatgaa 
aatataattt 
ccagtcgcag 
tgtgtttctt 
gctcctgaga 



tgccagcagc 
cttcattctt 
taagcctttg 
cagaagagat 
gatgatgact 
aacattttgc 
gagagcagtt 
tttcaccact 
gttttttgat 
attgtagaca 
gccatgttcc 
agctgtcctc 
tactacttct 
cctcttgttc 
ccaccaggca 
catgaaggcg 
cagcagggga 
gagtcgttgt 
ggaggaagaa 
acttctggac 
agatccagaa 
gaaatcatgt 
gacactgcag 
tcctctggag 
agaagccaca 
ccccagtgat 
aacaccccag 



ggcaccgtta 
cttcaatagc 
tgactcttcc 
ttttaacatc 
agaaacagca 
tcaagtagag 
cttccatatc 
tgctgttttt 
tcgatatcag 
gcatagtgta 
agcaacatta 
tttttgttgt 
gaattcccat 
acatccgtgt 
gctctgtgga 
ctgtcatcgt 
agcagtggca 
ggtctccaga 
atgcaggaaa 
agcaggtcac 
accacaatat 
catctgcggc 
tatatctgca 
ccatatggat 
ctgaagctct 
gcagcctcaa 
ctcttccggt 



gtcaggtttt 
cataaatctt 
tctgatgtca 
tgtttttctt 
tactctctgg 
ggctgactat 
tatccagcgc 
gctcatgtat 
caccgtataa 
gagtggtatt 
acgcacattc 
caaggacatt 
tggcagaggc 
ccctgagcat 
gcttgtccag 
agtctcccca 
gcaccacttg 
agtgcccacg 
tgaaagatgc 
ttccagcaag 
ccattcacaa 
aacatggtgg 
caacgtaata 
gaactatgaa 
gtcctcagcc 
gttatcccga 
ctaacacagg 



ctgggaatcc 


cacatgagta 


60 


ctagctctgg 


ctggctgttt 


120 


gctttaagtc 


ttgttctgga 


180 


tgtagtcaga 


aagtaactgg 


240 


ccgtctttcc 


agatcttgag 


300 


acttgctgat 


ccacaacata 


360 


atttaaattc 


gcttttttct 


420 


accaagtagc 


agtggtgtga 


480 


gagcagtgct 


ttggccatta 


540 


tccatactca 


tctggaatat 


600 


atcttcctgg 


cattgtacgg 


660 


aagttgacat 


cgtctgtcca 


720 


cagatgtaga 


gcagtcctct 


780 


gacgatgaga 


tcctttctgg 


840 


atcttctcca 


tggacgtggt 


900 


agcgaccacg 


ttgctcttgc 


960 


cacctcttgc 


tcccaagcgt 


1020 


ttgctcttgc 


cgctccccct 


1080 


atgcacgatg 


gtatactcct 


1140 


gtggagaaag 


ctgtccaccc 


1200 


acaaacactt 


ttcagccaga 


1260 


aacctaccca 


atcacacatc 


1320 


ctcttcatcc 


ataacaaaat 


1380 


ggaagaactc 


cccgaagaag 


1440 


atcagcgcca 


cggacaggar 


1500 


agctgccgca 


gcacacggtg 


1560 


caagtcaata 


aatgtgataa 


1620 



wo 01/75171 



PCT/USOl/10631 



2 



tcacataaac agaattaaaa gcaaagtcac ataagcatct caacagacac agaaaaggca 1680 

tttgacaaaa tccagcatcc ttgtatttat tgttgcagtt ctcagaggaa atgcttctaa 17 40 

cttttcccca tttagtatta tgttggctgt gggcttgtca taggtggttt ttattacttt 1800 

aaggtatgtc ccttctatgc ctgttttgct gagggtttta attctcgtgc c 1851 

<210> 2 

<211> 329 
<212> PRT 
<213> Homo sapien 



<400> 2 



Met 


Asp 


He 


Val 


Val 


Ser 


Gly 


Ser 


His 


Pro 


Leu 


Trp 


Val 


Asp 


Ser 


Phe 


1 








5 










10 










15 




Leu 


His 


Leu 


Ala 


Gly 


Ser 


Asp 


Leu 


Leu 


Ser 


Arg 


Ser 


Leu 


Met 


Ala 


Glu 








20 










25 










30 






Glu 


Tyr 


Thr 


He 


Val 


His 


Ala 


Ser 


Phe 


He 


Ser 


Cys 


He 


Ser 


Ser 


Ser 






35 










40 










45 








Leu 


Asp 


Gly Gin 


Gly 


Glu 


Arg 


Gin 


Glu 


Gin 


Arg 


Gly 


His 


Phe 


Trp Arg 




50 










55 










60 










Pro 


Gin 


Arg 


Leu 


Leu 


Cys 


Glu 


Asp 


Ala 


Trp 


Glu 


Gin 


Glu 


Val 


Gin 


Val 


65 










70 










75 










80 


Val 


Leu 


Pro 


Leu 


Leu 


Pro 


Leu 


Leu 


Gin 


Gly 


Ser 


Gly 


Lys 


Ser 


Asn 


Val 










85 










90 










95 




Val 


Ala 


Trp 


Gly 


Asp 


Tyr 


Asp 


Asp 


Ser 


Ala 


Phe 


Met 


Asp 


Pro 


Arg 


Tyr 








100 










105 










110 






His 


Val 


His 


Gly 


Glu 


Asp 


Leu 


Asp 


Lys 


Leu 


His 


Arg 


Ala 


Ala 


Trp 


Trp 






115 










120 










125 








Gly 


Lys 


Val 


Pro 


Arg 


Lys 


Asp 


Leu 


He 


Val 


Met 


Leu 


Arg 


Asp 


Thr 


Asp 




130 










135 










140 










Val 


Asn 


Lys 


Arg 


Asp 


Lys 


Gin 


Lys 


Arg 


Thr 


Ala 


Leu 


His 


Leu 


Ala 


Ser 


145 










150 










155 










160 


Ala 


Asn 


Gly Asn 


Ser 


Glu 


Val 


Val 


Lys 


Leu 


Val 


Leu 


Asp 


Arg 


Arg 


Cys 










165 










170 










175 




Gin 


Leu 


Asn 


Val 


Leu 


Asp 


Asn 


Lys 


Lys 


Arg 


Thr 


Ala 


Leu 


Thr 


Lys Ala 








180 










185 










190 






Val 


Gin 


Cys 


Gin 


Glu 


Asp 


Glu 


Cys 


Ala 


Leu 


Met 


Leu 


Leu 


Glu 


His 


Gly 






195 










200 










205 








Thr 


Asp 


Pro 


Asn 


He 


Pro 


Asp 


Glu 


Tyr 


Gly 


Asn 


Thr 


Thr 


Leu 


His 


Tyr 




210 










215 










220 










Ala 


Val 


Tyr 


Asn 


Glu 


Asp 


Lys 


Leu 


Met 


Ala 


Lys 


Ala 


Leu 


Leu 


Leu Tyr 


225 










230 










235 










240 


Gly 


Ala 


Asp 


He 


Glu 


Ser 


Lys 


Asn 


Lys 


His 


Gly 


Leu 


Thr 


Pro 


Leu 


Leu 










245 










250 










255 




Leu 


Gly 


He 


His 


Glu 


Gin 


Lys 


Gin 


Gin 


Val 


Val 


Lys 


Phe 


Leu 


He 


Lys 








260 










265 










270 






Lys 


Lys 


Ala 


Asn 


Leu 


Asn 


Ala 


Leu 


Asp 


Arg 


Tyr 


Gly 


Arg 


Thr 


Ala 


Leu 






275 










280 










285 








He 


Leu 


Ala 


Val 


Cys 


Cys 


Gly 


Ser 


Ala 


Ser 


He 


Val 


Ser 


Pro 


Leu 


Leu 




290 










295 










300 










Glu 


Gin 


Asn 


Val 


Asp 


Val 


Ser 


Ser 


Gin 


Asp 


Leu 


Glu 


Arg 


Arg 


Pro 


Glu 


305 










310 










315 










320 


Ser 


Met 


Leu 


Phe 


Leu 


Val 


He 


He 


Met 

















325 



<210> 3 
<211> 1852 
<212> DNA 

<213> Homo sapiens 
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<400> 3 

ggcacgagaa ttaaaaccct cagcaaaaca ggcatagaag ggacatacct taaagtaata 60 
aaaaccacct atgacaagcc cacagccaac ataatactaa atggggaaaa gttagaagca 120 
tttcctctga gaactgcaac aataaataca aggatgctgg attttgtcaa atgccttttc 180 
tgtgtctgtt gagatgctta tgtgactttg cttttaattc tgtttatgtg attatcacat 240 
ttattgactt gcctgtgtta gaccggaaga gctggggtgt ttctcaggag ccaccgtgtg 300 
ctgcggcagc ttcgggataa cttgaggctg catcactggg gaagaaacac aytcctgtcc 360 
gtggcgctga tggctgagga cagagcttca gtgtggcttc tctgcgactg gcttcttcgg 420 
ggagttcttc cttcatagtt catccatatg gctccagagg aaaattatat tattttgtta 480 
tggatgaaga gtattacgtt gtgcagatat actgcagtgt cttcatctct tgatgtgtga 540 
ttgggtaggt tccaccatgt tgccgcagat gacatgattt cagtacctgt gtctggctga 600 
aaagtgtttg tttgtgaatg gatattgtgg tttctggatc tcatcctctg tgggtggaca 660 
gctttctcca ccttgctgga agtgacctgc tgtccagaag tttgatggct gaggagtata 720 
ccatcgtgca tgcatctttc atttcctgca tttcttcctc cctggatgga cagggggagc 780 
ggcaagagca acgtgggcac ttctggagac cacaacgact cctctgtgaa gacgcttggg 840 
agcaagaggt gcaagtggtg ctgccactgc ttcccctgct gcagggggag cggcaagagc 900 
aacgtggtcg cttggggaga ctacgatgac agcgccttca tggatcccag gtaccacgtc 960 
catggagaag atctggacaa gctccacaga gctgcctggt ggggtaaagt ccccagaaag 1020 
gatctcatcg tcatgctcag ggacacggat gtgaacaaga gggacaagca aaagaggact 1080 
gctctacatc tggcctctgc caatgggaat tcagaagtag taaaactcgt gctggacaga 1140 
cgatgtcaac ttaatgtcct tgacaacaaa aagaggacag ctctgacaaa ggccgtacaa 1200 
tgccaggaag atgaatgtgc gttaatgttg ctggaacatg gcactgatcc aaatattcca 1260 
gatgagtatg gaaataccac tctacactat gctgtctaca atgaagataa attaatggcc 1320 
aaagcactgc tcttatacgg tgctgatatc gaatcaaaaa acaagcatgg cctcacacca 1380 
ctgctacttg gtatacatga gcaaaaacag caagtggtga aatttttaat caagaaaaaa 1440 
gcgaatttaa atgcgctgga tagatatgga agaactgctc tcatacttgc tgtatgttgt 1500 
ggatcagcaa gtatagtcag ccctctactt gagcaaaatg ttgatgtatc ttctcaagat 1560 
ctggaaagac ggceagagag tatgctgttt ctagtcatca tcatgtaatt tgccagttac 1620 
tttctgacta caaagaaaaa cagatgttaa aaatctcttc tgaaaacagc aatccagaac 1680 
aagacttaaa gctgacatca gaggaagagt cacaaaggct taaaggaagt gaaaacagcc 1740 
agccagagct agaagattta tggctattga agaagaatga agaacacgga agtactcatg 1800 
tgggattccc agaaaacctg actaacggtg ccgctgctgg caatggtgat ga 1852 

<210> 4 
<211> 292 
<212> PRT 

<213> Homo sapiens 
<400> 4 

Met His Leu Ser Phe Pro Ala Phe Leu Pro Pro Trp Met Asp Arg Gly 
5 10 15 

Ser Gly Lys Ser Asn Val Gly Thr Ser Gly Asp His Asn Asp Ser Ser 
20 25 30 

Val Lys Thr Leu Gly Ser Lys Arg Cys Lys Trp Cys Cys His Cys Phe 
35 40 45 

Pro Cys Cys Arg Gly Ser Gly Lys Ser Asn Val Val Ala Trp Gly Asp 
50 55 60 

Tyr Asp Asp Ser Ala Phe Met Asp Pro Arg Tyr His Val His Gly Glu 
65 70 75 80 



Asp Leu Asp Lys Leu His Arg Ala Ala Trp Trp Gly Lys Val Pro Arg 
85 90 95 
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Lys Asp Leu lie Val Met Leu Arg Asp Thr Asp Val Asn Lys Arg Asp 
100 105 110 

Lys Gin Lys Arg Thr Ala Leu His Leu Ala Ser Ala Asn Gly Asn Ser 
115 120 125 

Glu Val Val Lys Leu Val Leu Asp Arg Arg Cys Gin Leu Asn Val Leu 
130 135 140 

Asp Asn Lys Lys Arg Thr Ala Leu Thr Lys Ala Val Gin Cys Gin Glu 
145 150 155 160 

Asp Glu Cys Ala Leu Met Leu Leu Glu His Gly Thr Asp Pro Asn lie 
165 170 175 

Pro Asp Glu Tyr Gly Asn Thr Thr Leu His Tyr Ala Val Tyr Asn Glu 
180 185 190 

Asp Lys Leu Met Ala Lys Ala Leu Leu Leu Tyr Gly Ala Asp He Glu 
195 200 205 

Ser Lys Asn Lys His Gly Leu Thr Pro Leu Leu Leu Gly He His Glu 
210 215 220 

Gin Lys Gin Gin Val Val Lys Phe Leu He Lys Lys Lys Ala Asn Leu 
225 230 235 240 

Asn Ala Leu Asp Arg Tyr Gly Arg Thr Ala Leu He Leu Ala Val Cys 
245 250 255 

Cys Gly Ser Ala Ser He Val Ser Pro Leu Leu Glu Gin Asn Val Asp 
260 265 270 

Val Ser Ser Gin Asp Leu Glu Arg Arg Pro Glu Ser Met Leu Phe Leu 
275 280 285 

Val He He Met 
290 

<210> 5 

<211> 1155 
' <212> DNA 
<213> Homo sapien 

<400> 5 

atggtggttg aggttgattc catgccggct gcctcttctg tgaagaagcc atttggtctc 60 

aggagcaaga tgggcaagtg gtgctgccgt tgcttcccct gctgcaggga gagcggcaag 120 

agcaacgtgg gcacttctgg agaccacgac gactctgcta tgaagacact caggagcaag 180 

atgggcaagt ggtgccgcca ctgcttcccc tgctgcaggg ggagtggcaa gagcaacgtg 240 

ggcgcttctg gagaccacga cgactctgct atgaagacac tcaggaacaa gatgggcaag 300* 

tggtgctgcc actgcttccc ctgctgcagg gggagcggca agagcaaggt gggcgcttgg 360 

ggagactacg atgacagtgc cttcatggag cccaggtacc acgtccgtgg agaagatctg 420 

gacaagctcc acagagctgc ctggtggggt aaagtcccca gaaaggatct catcgtcatg 480 

ctcagggaca ctgacgtgaa caagaaggac aagcaaaaga ggactgctct acatctggcc 540 

tctgccaatg ggaattcaga agtagtaaaa ctcctgctgg acagacgatg tcaacttaat 600 

gtccttgaca acaaaaagag gacagctctg ataaaggccg tacaatgcca ggaagatgaa 660 

tgtgcgttaa tgttgctgga acatggcact gatccaaata ttccagatga gtatggaaat 720 

accactctgc actacgctat ctataatgaa gataaattaa tggccaaagc actgctctta 780 
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tatggtgctg 
catgagcaaa 
ctggatagat 
gtcagccttc 
gccagagagt 
aaagaaaaac 
accagaaata 



atatcgaatc 
aacagcaagt 
atggaaggac 
tacttgagca 
atgctgtttc 
agatgctaaa 
aataa 



aaaaaacaag 
cgtgaaattt 
tgctctcata 
aaatattgat 
tagtcatcat 
aatctcttct 



catggcctca 
ttaatcaaga 
cttgctgtat 
gtatcttctc 
catgtaattt 
gaaaacagca 



caccactgtt 
aaaaagcgaa 
gttgtggatc 
aagatctatc 
gccagttact 
atccagaaaa 



acttggtgta 
tttaaatgca 
agcaagtata 
tggacagacg 
ttctgactac 
tgtctcaaga 



840 
900 
960 
1020 
1080 
1140 
1155 



<210> 6 
<211> 2000 
<212> DNA 
<213> Homo sapien 



<400> 6 



atggtggttg 
aggagcaaga 
agcaacgtgg 
atgggcaagt 
ggcgcttctg 
tggtgctgcc 
ggagactacg 
gacaagctcc 
ctcagggaca 
tctgccaatg 
gtccttgaca 
tgtgcgttaa 
accactctgc 
tatggtgctg 
catgagcaaa 
ctggatagat 
gtcagccttc 
gccagagagt 
aaagaaaaac 
ctgacatcag 
atgtctcaag 
aagcatgaaa 
aatggtgata 
cctgacaacg 
aaacagatgc 
tcagaggaag 
tttatggcta 
ctgactaatg 
agaacacctg 
caaaatgata 
attctgattc 
cttagttgta 
gccatgctaa 
aaaaaaaaaa 



aggttgattc 
tgggcaagtg 
gcacttctgg 
ggtgccgcca 
gagaccacga 
actgcttccc 
atgacagtgc 
acagagctgc 
ctgacgtgaa 
ggaattcaga 
acaaaaagag 
tgttgctgga 
actacgctat 
atatcgaatc 
aacagcaagt 
atggaaggac 
tacttgagca 
atgctgtttc 
agatgctaaa 
aggaagagtc 
aaccagaaat 
gtaataatgt 
atggattaat 
aaagtgaaga 
caaaatactc 
agtcacaaag 
tcgaagaaat 
gtgccactgc 
aaagccagca 
ctcagaagca 
atgaagaaaa 
agaaagaaaa 
gactggagct 
aaaaaaaaaa 



catgccggct 
gtgctgccgt 
agaccacgac 
ctg'cttcccc 
cgactctgct 
ctgctgcagg 
cttcatggag 
ctggtggggt 
caagaaggac 
agtagtaaaa 
gacagctctg 
acatggcact 
ctataatgaa 
aaaaaacaag 
cgtgaaattt 
tgctctcata 
aaatattgat 
tagtcatcat 
aatctcttct 
acaaaggttc 
aaataaggat 
g'ggattacta 
tcctcaaagg 
gtatcacaga 
ttctgaaaac 
gcttgagggc 
gaagaagcac 
tggcaatggt 
atttcctgac 
attttgtgaa 
gcagatagaa 
agacatcttg 
agacacaatg 



gcctcttctg 
tgcttcccct 
gactctgcta 
tgctgcaggg 
atgaagacac 
gggagcggca 
cccaggtacc 
aaagtcccca 
aagcaaaaga 
ctcctgctgg 
ataaaggccg 
gatccaaata 
gataaattaa 
catggcctca 
ttaatcaaga 
cttgctgtat 
gtatcttctc 
catgtaattt 
gaaaacagca 
aaaggcagtg 
ggtgatagag 
gaaaacctga 
aagagcagaa 
atttgcgaat 
agcaacccag 
agtgaaaatg 
ggaagtactc 
gatgatggat 
actgagaatg 
gaacagaaca 
gtggttgaaa 
catgaaaata 
aaacatcaga 



tgaagaagcc 
gctgcaggga 
tgaagacact 
ggagtggcaa 
tcaggaacaa 
agagcaaggt 
acgtccgtgg 
gaaaggatct 
ggactgctct 
acagacgatg 
tacaatgcca 
ttccagatga 
tggccaaagc 
caccactgtt 
aaaaagcgaa 
gttgtggatc 
aagatctatc 
gccagttact 
atccagaaca 
aaaatagcca 
aggttgaaga 
ctaatggtgt 
cacctgaaaa 
tagtttctga 
aacaagactt 
gccagccaga 
atgtcggatt 
taattcctcc 
aagagtatca 
ctggaatatt 
aaatgaattc 
gtacgttgcg 
gccagctaaa 



atttggtctc 
gagcggcaag 
caggagcaag 
gagcaacgtg 
gatgggcaag 
gggcgcttgg 
agaagatctg 
catcgtcatg 
acatctggcc 
tcaacttaat 
ggaagatgaa 
gtatggaaat 
actgctctta 
acttggtgta 
tttaaatgca 
agcaagtata 
tggacagacg 
ttctgactac 
agacttaaag 
gccagagaaa 
agaaatgaag 
cactgctggc 
tcagcaattt 
ctacaaagaa 
aaagctgaca 
gctagaaaat 
cccagaaaac 
aaggaagagc 
cagtgacgaa 
acacgatgag 
tgagctttct 
ggaagaaatt 
aaaaaaaaaa 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2000 



<210> 7 
<211> 2040 
<212> DNA 
<213> Homo sapien 

<400> 7 

atggtggttg aggttgattc catgccggct gcctcttctg tgaagaagcc atttggtctc 
aggagcaaga tgggcaagtg gtgctgccgt tgcttcccct gctgcaggga gagcggcaag 
agcaacgtgg gcacttctgg agaccacgac gactctgcta tgaagacact caggagcaag 



60 
120 
180 



wo OJ/75171 



PCT/USOl/10631 



6 



atgggcaagt 
ggcgcttctg 
tggtgctgcc 
ggagactacg 
gacaagctcc 
ctcagggaca 
tctgccaatg 
gtcpttgaca 
tgtgcgttaa 
accactctgc 
tatggtgctg 
catgagcaaa 
ctggatagat 
gtcagccttc 
gccagagagt 
aaagaaaaac 
ctgacatcag 
atgtctcaag 
aagcatgaaa 
aatggtgata 
cctgacaacg 
aaacagatgc 
tcagaggaag 
caagaaccag 
gaaatgaaga 
actgctggca 
cagcaatttc 
aagcaatttt 
gaaaagcaga 
gaaaaagaca 
gagctagaca 



ggtgccgcca 
gagaccacga 
actgcttccc 
atgacagtgc 
acagagctgc 
ctgacgtgaa 
ggaattcaga 
acaaaaagag 
tgttgctgga 
actacgctat 
atatcgaatc 
aacagcaagt 
atggaaggac 
tacttgagca 
atgctgtttc 
agatgctaaa 
aggaagagtc 
aaccagaaat 
gtaataatgt 
atggattaat 
aaagtgaaga 
caaaatactc 
agtcacaaag 
aaataaataa 
agcacggaag 
atggtgatga 
ctgacactga 
gtgaagaaca 
tagaagtggt 
tcttgcatga 
caatgaaaca 



ctgcttcccc 
cgactctgct 
ctgctgcagg 
cttcatggag 
ctggtggggt 
caagaaggac 
agtagtaaaa 
gacagctctg 
acatggcact 
ctataatgaa 
aaaaaacaag 
cgtgaaattt 
tgctctcata 
aaatattgat 
tagtcatcat 
aatctcttct 
acaaaggttc 
aaataaggat 
gggattacta 
tcctcaaagg 
gtatcacaga 
ttctgaaaac 
gcttgagggc 
ggatggtgat 
tactcatgtc 
tggattaatt 
gaatgaagag 
gaacactgga 
tgaaaaaatg 
aaatagtacg 
tcagagccag 



tgctgcaggg 
atgaagacac 
gggagcggca 
cccaggtacc 
aaagtcccca 
aagcaaaaga 
ctcctgctgg 
ataaaggccg 
gatccaaata 
gataaattaa 
catggcctca 
ttaatcaaga 
cttgctgtat 
gtatcttctc 
catgtaattt 
gaaaacagca 
aaaggcagtg 
ggtgatagag 
gaaaacctga 
aagagcagaa 
atttgcgaat 
agcaacccag 
agtgaaaatg 
agagagctag 
ggattcccag 
cctccaagga 
tatcacagtg 
atattacacg 
aattctgagc 
ttgcgggaag 
ctaaaaaaaa 



ggagtggcaa 
tcaggaacaa 
agagcaaggt 
acgtccgtgg 
gaaaggatct 
ggactgctct 
acagacgatg 
tacaatgcca 
ttccagatga 
tggccaaagc 
caccactgtt 
aaaaagcgaa 
gttgtggatc 
aagatctatc 
gccagttact 
atccagaaca 
aaaatagcca 
aggttgaaga 
ctaatggtgt 
cacctgaaaa 
tagtttctga 
aacaagactt 
gccagccaga 
aaaattttat 
aaaacctgac 
agagcagaac 
acgaacaaaa 
atgagattct 
tttctcttag 
aaattgccat 
aaaaaaaaaa 



gagcaacgtg 
gatgggcaag 
gggcgcttgg 
agaagatctg 
catcgtcatg 
acatctggcc 
tcaacttaat 
ggaagatgaa 
gtatggaaat 
actgctctta 
acttggtgta 
tttaaatgca 
agcaagtata 
tggacagacg 
ttctgactac 
agacttaaag 
gccagagaaa 
agaaatgaag 
cactgctggc 
tcagcaattt 
ctacaaagaa 
aaagctgaca 
gaaaagatct 
ggctatcgaa 
taatggtgcc 
acctgaaagc 
tgatactcag 
gattcatgaa 
ttgtaagaaa 
gctaagactg 
aaaaaaaaaa 



240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 



<210> 8 





<211> 


384 




























<212> 


PRT 




























<213> 


Homo sapien 
























<400> 


8 


























Met 


Val 


Val 


Glu 


Val 


Asp 


Ser 


Met 


Pro 


Ala 


Ala 


Ser 


Ser 


Val 


Lys 


Lys 


1 








5 










10 










15 




Pro 


Phe 


Gly 


Leu 


Arg 


Ser 


Lys 


Met 


Gly 


Lys 


Trp 


Cys 


Cys 


Arg 


Cys 


Phe 








20 










25 










30 






Pro 


Cys 


Cys 


Arg 


Glu 


Ser 


Gly 


Lys 


Ser 


Asn 


Val 


Gly 


Thr 


Ser 


Gly 


Asp 






35 










40 










45 








His 


Asp 


Asp 


Ser 


Ala 


Met 


Lys 


Thr 


Leu 


Arg 


Ser 


Lys 


Met 


Gly 


Lys 


Trp 




50 










55 










60 










Cys 


Arg 


His 


Cys 


Phe 


Pro 


Cys 


Cys 


Arg 


Gly 


Ser 


Gly 


Lys 


Ser 


Asn 


Val 


65 










70 










75 










80 


Gly 


Ala 


Ser 


Gly 


Asp 


His 


Asp 


Asp 


Ser 


Ala 


Met 


Lys 


Thr 


Leu 


Arg 


Asn 










85 










90 










95 




Lys 


Met 


Gly 


Lys 


Trp 


Cys 


Cys 


His 


Cys 


Phe 


Pro 


Cys 


Cys 


Arg 


Gly 


Ser 








100 










105 










110 






Gly 


Lys 


Ser 


Lys 


Val 


Gly 


Ala 


Trp 


Gly 


Asp 


Tyr 


Asp 


Asp 


Ser 


Ala 


Phe 






115 










120 










125 








Met 


Glu 


Pro 


Arg 


Tyr 


His 


Val 


Arg 


Gly 


Glu 


Asp 


Leu 


Asp 


Lys 


Leu 


His 




130 










135 










140 










Arg 


Ala 


Ala 


Trp 


Trp 


Gly 


Lys 


Val 


Pro 


Arg 


Lys 


Asp 


Leu 


He 


Val 


Met 


145 










150 










155 










160 
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Leu 


Arg 


Asp 


Thr Asp Val 


Asn 


Lys 


Lys 


Asp 


Lys 


Gin 


Lys Arg 


Thr 


Ala 










165 










1 T n 
± / \j 








175 




Leu 


His 


Leu 


Ala 


Ser 


Ala 


Asn 


Gly Asn 


Ser 


Glu 


Val 


Val Lys 


Leu 


Leu 








180 


















190 






Leu 


Asp 


Arg 


Arg 


Cys 


Gin 


Leu 


Asn 


va± 


Leu 


Asp Asn Lys Lys 


Arg 


Thr 






195 










o n A 










205 






TV 1 -1 

Ala 


Leu 


He Lys Ala Val 


Gin 


Cys 


Gin 




Asp Glu Cys Ala Leu Met 




210 










215 










220 








Leu 


Leu 


Glu 


His 


Gly Thr Asp 


Pro 


Asn 


lie 


Pro Asp Glu Tyr Gly Asn 


225 










230 










235 








240 


Thr 


Thr 


Leu 


His 


Tyr 
245 


Ala 


He 


Tyr 


Asn 


Glu 

250 


Asp 


Lys 


Leu Met 


Ala 
255 


Lys 


Ala 


Leu 


Leu 


Leu 


Tyr 


Gly Ala 


Asp 


He 


Glu 


Ser 


Lys 


Asn Lys 


His 


Gly 








260 










265 








270 






Leu 


Thr 


Pro 


Leu 


Leu 


Leu Gly 


Val 


His 


Glu 


Gin 


Lys 


Gin Gin 


Val 


Val 






275 










280 










285 






Lys 


Phe 


Leu 


He 


Lys 


Lys 


Lys 


Ala 


Asn 


Leu 


Asn 


Ala 


Leu Asp Arg 


Tyr 




290 










295 










300 








Gly 


Arg 


Thr 


Ala 


Leu 


He 


Leu 


Ala 


Val 


Cys 


Cys 


Gly 


Ser Ala 


Ser 


He 


305 










310 










315 








320 


Val 


Ser 


Leu 


Leu 


Leu 


Glu 


Gin 


Asn 


He 


Asp 


Val 


Ser 


Ser Gin 


Asp Leu 










325 










330 








335 




Ser 


Gly 


Gin 


Thr 
340 


Ala 


Arg 


Glu 


Tyr 


Ala 
345 


Val 


Ser 


Ser 


His His 
350 


His 


Val 


He 


Cys 


Gin 
355 


Leu 


Leu 


Ser 


Asp 


Tyr 
360 


Lys 


Glu 


Lys 


Gin 


Met Leu 
365 


Lys 


He 


Ser 


Ser 
370 


Glu 


Asn 


Ser 


Asn 


Pro 
375 


Glu 


Asn 


Val 


Ser 


Arg 
380 


Thr Arg 


Asn 


Lys 



<210> 9 

<211> 656 

<212> PRT 

<213> Homo sapien 

<400> 9 



Met 


Val 


Val Glu 


Val 


Asp 


Ser 


Met 


Pro Ala 


Ala Ser Ser Val 


Lys Lys 


1 






5 








10 




15 


Pro 


Phe 


Gly Leu 


Arg 


Ser 


Lys 


Met 


Gly Lys 


Trp Cys Cys Arg 


Cys Phe 






20 










25 


30 




Pro 


Cys 


Cys Arg 


Glu 


Ser 


Gly 


Lys 


Ser Asn 


Val Gly Thr Ser 


Gly Asp 






35 








40 




45 




His 


Asp 


Asp Ser 


Ala 


Met 


Lys 


Thr 


Leu Arg 


Ser Lys Met Gly Lys Trp 




50 








55 






60 




Cys 


Arg 


His Cys 


Phe 


Pro 


Cys 


Cys 


Arg Gly 


Ser Gly Lys Ser 


Asn Val 


65 








70 








75 


80 


Gly 


Ala 


Ser Gly 


Asp 


His 


Asp 


Asp 


Ser Ala 


Met Lys Thr Leu Arg Asn 








85 








90 




95 


Lys 


Met 


Gly Lys 


Trp 


Cys 


Cys 


His 


Cys Phe 


Pro Cys Cys Arg 


Gly Ser 






100 










105 


110 




Gly 


Lys 


Ser Lys 


Val 


Gly 


Ala 


Trp Gly Asp 


Tyr Asp Asp Ser 


Ala Phe 






115 








120 




125 




Met 


Glu 


Pro Arg 


Tyr 


His 


Val 


Arg 


Gly Glu 


Asp Leu Asp Lys 


Leu His 




130 








135 






140 




Arg 


Ala 


Ala Trp 


Trp 


Gly Lys 


Val 


Pro Arg 


Lys Asp Leu He 


Val Met 


145 








150 








155 


160 


Leu 


Arg 


Asp Thr 


Asp 


Val 


Asn 


Lys 


Lys Asp 


Lys Gin Lys Arg 


Thr Ala 








165 








170 




175 


Leu 


His 


Leu Ala 


Ser 


Ala 


Asn 


Gly Asn Ser 


Glu Val Val Lys 


Leu Leu 
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180 










185 










190 








Asp 


Arg 


Arg 




Gin 


L^U 


Asn 


Val 


Leu Asp Asn Lys 


Lys 


Arg 


Thr 






195 










200 










205 








Ala 


Leu 


He 


Lys 


Ala 


Val 


Gin 


Cys 


Gin 


Glu Asp 


Glu 


Cys 


Ala 


Leu 


Met 




210 










215 










220 












Leu 


Glu 


His 




Thr 


Asp 


Pro 


Asn 


He 


Pro 


Asp 


Glu 


J. y J. 


Gly 


Asn 


225 










230 










235 










240 


Thr 


Thr 


Leu 


His 


Tvr 


Ala 


He 


A yx 


Asn 


Glu Asp Lys 


Leu 


Met 


Ala 


Lys 










245 










250 










255 




Ala 


Leu 


Leu 


Leu 


lyr 


Gly 


Ala 


sp 


He 


Glu 


Ser 


Lys 


Asn 


Lys 


His 


Gly 








260 










265 










270 






Leu 


Thr 


Pro 


Leu 


Leu 


Le 


Gly 


Val 


His 


Glu 


Gin 


Lys 


Gin 


Gin 


Val 


Val 






275 










280 










285 








ys 


Phe 




He 


Lys 


Lys 


Lys 




A n 


Leu 


Asn 


Ala 


Leu 


Asp 


Arg 


lyr 




290 










295 










300 












Arg 


1.111. 


AT a 


Leu 


Tl <a 


Leu 


Ala 
rVX cl 


V dx 


Cys 


Cys 


Gly Ser 




oer 




o\J 'J 




















315 












Vo-L 


Ser 


Leu 


Leu 


Leu 


xaXU 


Pin 


Asn 


xxe 


Asp Val 


Ser 


Ser 


Pi r» 

uxn 


Asp 


Leu 










J^ ^ 










330 














Ser 


r*i XT 


m n 


TVi T- 

i nr 


AX a 


Arg 


PI n 

uXU 


Tyr 


ivxa 


Val 


Ser 


Ser 


His 


nXS 


nXS 


vax 




























JOU 






xxe 


Cy s 


o±n 


Leu 


Leu 


Ser 


Asp 


Tyr 


Lys 


Glu Lys 


Gin 


Met 


Leu 


Lys 


Tl ia 

xxe 
















o ou 










365 








t>er 


Ser 


Pi n 

taJ-U 


Asn 


ser 


Asn 


Pro 


PI 11 

bXU 


PI i-k 
bxn 


Asp Leu Lys 


Leu 


1 nr 


Ser 


Pin 

GXU 




O / u 




















380 












t3±U 


Ser 


PI *i 
b±n 


Arg 


rne 


Lys 


PI 1» 

Caxy 


ser 


Glu 


Asn 


Ser 


Gin 


Pro 


P1 11 

Glu 


Lys 


OtJO 










•son 

jy u 










395 










4UU 




ber 




Glu 


Pro 


PI 11 
bXU 


ixe 


Asn 


Lys 


Asp Gly Asp Arg 


p 1 1, 

UXU 


Vax 


PI,-. 
bXU 




















410 










4XO 






(jXU 


Met 


Lys 


Lys 


nXS 


P1 11 

UXU 


Ser 


Asn 


Asn 


Val 


Gly Leu 


Leu 


Glu 


Asn 


















4ZO 










4 OU 






Leu 


i fir 


Asn 


PI IF 

oxy 


vax 


inr 


/\xa 


PI \T 

oxy 


Asn 


Gly Asp Asn Gly 


Leu 


xxe 


fro 






i 

ft JO 










^ d fi 

4 4 U 










445 










Axg 


Lys 


Sejtr 


Arg 


xnr 


Pro 


Pill 
bXU 


Asn 


Gin 


Gin 


Phe 


Pro 


Asp 


Asn 


P1 11 

uXU 














/ICC 










460 










Seir 


/"111 
u±U 


pi 11 


Tyr 


nXS 


Arg 


xxe 


Cys 


Pin 

VaXU 


Leu 


Val 


Ser 


Asp 


Tyr 


Lys 


Pi 11 
oXU 












4 T n 










475 










^ O U 


Lys 


Pin 


wet 


Pro 


Lys 


Tyr 


Ser 


Ser 


Pi 11 
uXU 


Asn 


Ser 


Asn 


Pro 


Pill 

uXU 


PI n 

uxn 


Asp 










4D □ 










490 










>l O R 

4yo 




Leu 


Lys 


Leu 


i nr 


C A 

ber 


PI 11 

bXU 


Pill 

tjXU 


P1 11 

oXU 


oer 


Gin Arg 


Leu 


Glu 


P 1 *r 

oxy 


Ser 


PI 11 
uXU 








^ n n 

o u u 










ouo 










Rl A 

oxu 






Asn 




PTn 


Pro 


PI n 

uXu 


Leu 


PT n 

uXU 


Asn 


irlie 


Met 


Ala 


He 


Glu 


PI 1-1 

uXu 


Met 


Lys 






DlO 










o n 










525 








Lys 


His 


PT i» 

tjxy 


oer 


Thr 


U -1 CI 

nlS 


vax 


PI t» 
oxy 


Phe 


Pro 


Glu 


Asn 


Leu 


xnr 


Asn 


PI IT 

tixy 




530 










535 










540 










Ala 


Thr 


Ala 


Gly 


Asn 


Gly 


Asp 


Asp 


Gly 


Leu 


He 


Pro 


Pro 


Arg 


Lys 


Ser 


545 










550 










555 










560 


Arg 


Thr 


Pro 


Glu 


Ser 


Gin 


Gin 


Phe 


Pro 


Asp 


Thr 


Glu 


Asn 


Glu 


Glu 


Tyr 










565 










570 










575 




His 


Ser 


Asp 


Glu 


Gin 


Asn 


Asp 


Thr 


Gin 


Lys 


Gin 


Phe 


Cys 


Glu 


Glu 


Gin 








580 










585 










590 






Asn 


Thr 


Gly 


He 


Leu 


His 


Asp 


Glu 


He 


Leu 


He 


His 


Glu 


Glu 


Lys 


Gin 






595 










600 










605 








He 


Glu 


Val 


Val 


Glu 


Lys 


Met 


Asn 


Ser 


Glu 


Leu 


Ser 


Leu 


Ser 


Cys 


Lys 




610 










615 










620 










Lys 


Glu 


Lys 


Asp 


He 


Leu 


His 


Glu 


Asn 


Ser 


Thr 


Leu 


Arg 


Glu 


Glu 


He 


625 










630 










635 










640 


Ala 


Met 


Leu 


Arg 


Leu 


Glu 


Leu 


Asp 


Thr 


Met 


Lys 


His 


Gin 


Ser 


Gin 


Leu 
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645 650 655 





<210> 


10 






















<211> 


671 






















<212> 


PRT 






















<213> 


Homo 


sapien 


















<400> 


10 




















Met 


Val 


Val 


Glu 


Val 


Asp 


Ser Met 


Pro 


Ala Ala 


Ser 


Ser 


Val Lys 


Lys 


1 








5 








10 






15 




Pro 


Phe 


Gly 


Leu Arg 


Ser 


Lys Met 


Gly Lys Trp 


Cys 


Cys 


Arg Cys 


Phe 








20 








25 








30 




Pro 


Cys 


Cys 


Arg 


Glu 


Ser 


Gly Lys 


Ser 


Asn Val 


Gly 


Thr 


Ser Gly Asp 






35 








40 








45 






His 


Asp 


Asp 


Ser 


Ala 


Vik: L 


Lys Thr 


Leu Arg Ser 


Lys 


Met 


Gly Lys 


Trp 




50 










55 






60 








Cys 


Arg 


His 


Cys 


Phe 


Pro 


Cys Cys Arg Gly Ser 


Gly Lys 


Ser Asn 


Val 


65 
















75 








80 


Gly 


Ala 


Ser 


Gly Asp 




Asp Asp 


Ser 


Ala Met 


Lys 


Thr 


Leu Arg Asn 










85 








90 






95 




Lys 


Met 


Gly 


Lys 


Trp 


Cys 


Cys His 


Cys 


Phe Pro 


Cys 


Cys 


Arg Gly Ser 








100 








105 








110 




Gly 


Lys 


Ser 


Lys 


Val 




Ala Trp Gly Asp Tyr 


Asp Asp 


Ser Ala 


Phe 






115 








120 








125 






Met 


Glu 


Pro 


Arg 


Tyr 




Val Arg 


Gly 


Glu Asp 


Leu Asp 


Lys Leu 


His 




130 










135 






140 








Arg 


Ala 


Ala 


Trp 


Trp 


uxy 


Lys Val 


Pro 


Arg Lys 


Asp Leu 


He Val 


Met 


145 










J.OU 






155 








160 


Leu 


Arg 


Asp 


Thr 


Asp 


vax 


Asn Lys 


Lys 


Asp Lys 


Gin 


Lys 


Arg Thr 


Ala 










165 








170 






175 




Leu 


His 


Leu 


Ala 


Ser 




Asn Gly Asn 


Ser Glu 


Val 


Val 


Lys Leu 


Leu 








180 








185 








190 




Leu 


Asp 


Arg 


Arg 


Cys 




Leu Asn 


Val 


Leu Asp 


Asn 


Lys 


Lys Arg 


Thr 






195 








200 








205 






Ala 


Leu 


He 


Lys Ala 


va± 


Gin Cys 


Gin 


Glu Asp 


Glu 


Cys 


Ala Leu 


Met 




210 










215 






220 








Leu 


Leu 


Glu 


His 


Gly 


Thr 


Asp Pro 


Asn 


He Pro 


Asp Glu Tyr Gly Asn 


225 










o "a A 
2 JU 






235 








240 


Thr 


Thr 


Leu 


His 


Tyr 


Ala 


He Tyr 


Asn 


Glu Asp 


Lys 


Leu 


Met Ala 


Lys 










245 








250 






255 




Ala 


Leu 


Leu 


Leu 


Tyr 


Gly 


Ala Asp 


He 


Glu Ser 


Lys 


Asn 


Lys His 


Gly 








260 








265 








270 




Leu 


Thr 


Pro 


Leu 


Leu 


Leu 


Gly Val 


His 


Glu Gin 


Lys 


Gin 


Gin Val 


Val 






275 








280 








285 






Lys 


Phe 


Leu 


He 


Lys 


Lys 


Lys Ala 


Asn 


Leu Asn 


Ala 


Leu Asp Arg 


Tyr 




290 










295 






300 








Gly 


Arg 


Thr 


Ala 


Leu 


He 


Leu Ala 


Val 


Cys Cys 


Gly Ser Ala Ser 


He 


305 










310 






315 








320 


Val 


Ser 


Leu 


Leu 


Leu 


Glu 


Gin Asn 


He Asp Val 


Ser 


Ser 


Gin Asp 


Leu 










325 








330 






335 




Ser 


Gly 


Gin 


Thr 


Ala 


Arg 


Glu Tyr 


Ala 


Val Ser 


Ser 


His 


His His 


Val 








340 








345 








350 




He 


Cys 


Gin 


Leu 


Leu 


Ser 


Asp Tyr 


Lys 


Glu Lys 


Gin 


Met 


Leu Lys 


He 






355 








360 








365 






Ser 


Ser 


Glu 


Asn 


Ser 


Asn 


Pro Glu 


Gin 


Asp Leu 


Lys 


Leu 


Thr Ser 


Glu 




370 










375 






380 








Glu 


Glu 


Ser 


Gin Arg 


Phe 


Lys Gly 


Ser 


Glu Asn 


Ser 


Gin 


Pro Glu 


Lys 


385 










390 






395 








400 
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Met 


Ser 




Glu 


Pro 


Glu 


He 


Asn 


Lys 


Asp 


Gly Asp Arg Glu Val Glu 










405 










410 






415 


Glu 


Glu 


Met 


Lys 


Lys 


His 


Glu 


Ser 


Asn 


Asn 


Val Gly Leu Leu 


Glu Asn 








420 










425 






430 




Leu 


Thr 


Asn 


Gly Val 


Thr 


Ala 


Gly Asn Gly 


Asp Asn Gly Leu 


He Pro 






4 JO 










440 








445 






Arg 


Lys 


Ser Arg 


Thr 


Pro 


Glu 


Asn 


Gin 


Gin Phe 


Pro Asp Asn Glu 














455 








460 






Ser 


Glu 


Glu 


Tyr 


His 


Arg 


He 


Cys 


Glu 


Leu 


Val Ser 


Asp Tyr 


Lys Glu 


465 










470 










475 




480 


Lys 


Gin 


Met 


Pro 


Lys 


Tyr 


Ser 


Ser 


Glu 


Asn 


Ser Asn 


Pro Glu Gin Asp 










485 










490 






495 


Leu 


Lys 


Leu 


Thr 


Ser 


Glu 


Glu 


Glu 


Ser 


Gin 


Arg Leu Glu Gly Ser Glu 








500 










505 






510 




Asn 


Gly 


Gin 


Pro 


Glu 


Lys 


Arg 


Ser 


Gin 


Glu 


Pro Glu He Asn Lys Asp 






515 










520 








525 




Gly 


Asp 


Arg 


Glu 


Leu 


Glu 


Asn 


Phe 


Met 


Ala 


He Glu 


Glu Met 


Lys Lys 




530 










535 








540 






His 


Gly 


Ser 


Thr 


His 


Val 


Gly Phe 


Pro 


Glu 


Asn Leu Thr Asn Gly Ala 


545 










550 










555 




560 


Thr 


Ala 


Gly 


Asn Gly Asp Asp Gly Leu 


He 


Pro Pro 


Arg Lys 


Ser Arg 










565 










570 






575 . 


Thr 


Pro 


Glu 


Ser 


Gin 


Gin 


Phe 


Pro 


Asp 


Thr 


Glu Asn 


Glu Glu 


Tyr His 








580 










585 






590 




Ser 


Asp 


Glu 


Gin Asn Asp Thr 


Gin 


Lys 


Gin 


Phe Cys 


Glu Glu 


Gin Asn 






595 










600 








605 




Thr 


Gly 


He 


Leu 


His 


Asp 


Glu 


He 


Leu 


He 


His Glu 


Glu Lys 


Gin He 




610 










'615 








620 






Glu 


Val 


Val 


Glu 


Lys 


Met 


Asn 


Ser 


Glu 


Leu 


Ser Leu 


Ser Cys 


Lys Lys 


625 










630 










635 


640 


Glu 


Lys 


Asp 


He 


Leu 


His 


Glu 


Asn 


Ser 


Thr 


Leu Arg 


Glu Glu 


He Ala 










645 










650 






655 


Met 


Leu 


Arg 


Leu Glu Leu Asp Thr Met 


Lys 


His Gin 


Ser Gin 


Leu 








660 










665 






670 





<210> 11 

<211> 800 
<212> DMA 
<213> Homo sapien 

<400> 11 

atkagcttcc gcttctgaca acactagaga tccctcccct ccctcagggt atggccctcc 60 

acttcatttt tggtacataa catctttata ggacaggggt aaaatcccaa tactaacagg 120 

agaatgctta ggactctaac aggtttttga gaatgtgttg gtaagggcca ctcaatccaa 180 

tttttcttgg tcctccttgt ggtctaggag gacaggcaag ggtgcagatt ttcaagaatg 240 

catcagtaag ggccactaaa tccgaccttc ctcgttcctc cttgtggtct gggaggaaaa 300 

ctagtgtttc tgttgctgtg tcagtgagca caactattcc gatcagcagg gtccagggac 360 

cactgcaggt tcttgggcag ggggagaaac aaaacaaacc aaaaccatgg gcrgttttgt 420 

ctttcagatg ggaaacactc aggcatcaac aggctcacct ttgaaatgca tcctaagcca 480 

atgggacaaa tttgacccac aaaccctgga aaaagaggtg gctcattttt tttgcactat 540 

ggcttggccc caacattctc tctctgatgg ggaaaaatgg ccacctgagg gaagtacaga 600 

ttacaatact atcctgcagc ttgacctttt ctgtaagagg gaaggcaaat ggagtgaaat 660 

accttatgtc caagctttct tttcattgaa ggagaataca ctatgcaaag cttgaaattt 720 

acatcccaca ggaggacctc tcagcttacc cccatatcct agcctcccta tagctcccct 7 80 

tcctattagt gataagcctc 800 



<210> 12 
<211> 102 
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<212> PRT 

<213> Homo sapien 

<220> 

<221> VARIANT 

<222> (1)...{102) 

<223> Xaa = Any Amino Acid 





<400> 


12 






















Met 


Gly Xaa 


Phe 


Val 


Phe 


Gin 


Met Gly 


Asn Thr 


Gin 


Ala 


Ser 


Thr 


Gly 


1 






5 








10 








15 




Ser 


Pro Leu 


Lys 
20 


Cys 


He 


Leu 


Ser Gin 
25 


Trp Asp 


Lys 


Phe 


Asp 
30 


Pro 


Gin 


Thr 


Leu Glu 
35 


Lys 


Glu 


Val 


Ala 


His Phe 
40 


Phe Cys 


Thr 


Met 
45 


Ala 


Trp 


Pro 


Gin 


His Ser 


Leu 


Ser 


Asp 


Gly Glu Lys 


Trp Pro 


Pro 


Glu Gly 


Ser 


Thr 




50 








55 






60 










Asp 


Tyr Asn 


Thr 


He 


Leu 


Gin 


Leu Asp 


Leu Phe 


Cys 


Lys 


Arg 


Glu Gly 


65 








70 






75 










80 


Lys 


Trp Ser 


Glu 


He 
85 


Pro 


Tyr 


Val Gin 


Ala Phe 
90 


Phe 


Ser 


Leu 


Lys 
95 


Glu 


Asn 


Thr Leu 


Cys 
100 


Lys 


Ala 



















<210> 13 

<211> 1206 

<212> DNA 

<213> Homo sapien 

<400> 13 

ggcacgagga agttttgtgt actgaaaaag aaactgtcag aagcaaaaga aataaaatca 60 

cagttagaga accaaaaagt taaatgggaa caagagctct gcagtgtgag gtttctcaca 120 

ctcatgaaaa tgaaaattat ctcttacatg aaaattgcat gttgaaaaag gaaattgcca 180 

tgctaaaact ggaaatagcc acactgaaac accaatacca ggaaaaggaa aataaatact 240 

ttgaggacat taagatttta aaagaaaaga atgctgaact tcagatgacc ctaaaactga 300 

aagaggaatc attaactaaa agggcatctc aatatagtgg gcagcttaaa gttctgatag 360 

ctgagaacac aatgctcact tctaaattga aggaaaaaca agacaaagaa atactagagg 420 

cagaaattga atcacaccat cctagactgg cttctgctgt acaagaccat gatcaaattg 480 

tgacatcaag aaaaagtcaa gaacctgctt tccacattgc aggagatgct tgtttgcaaa 540 

gaaaaatgaa tgttgatgtg agtagtacga tatataacaa tgaggtgctc catcaaccac 600 

tttctgaagc tcaaaggaaa tccaaaagcc taaaaattaa tctcaattat gccggagatg 660 

ctctaagaga aaatacattg gtttcagaac atgcacaaag agaccaacgt gaaacacagt 720 

gtcaaatgaa ggaagctgaa cacatgtatc aaaacgaaca agataatgtg aacaaacaca 780 

ctgaacagca ggagtctcta gatcagaaat tatttcaact acaaagcaaa aatatgtggc 840 

ttcaacagca attagttcat gcacataaga aagctgacaa caaaagcaag ataacaattg 900 

atattcattt tcttgagagg aaaatgcaac atcatctcct aaaagagaaa aatgaggaga 960 

tatttaatta caataaccat ttaaaaaacc gtatatatca atatgaaaaa gagaaagcag 1020 

aaacagaagt tatataatag tataacactg ccaaggagcg gattatctca tcttcatcct 1080 

gtaattccag tgtttgtcac gtggttgttg aataaatgaa taaagaatga gaaaaccaga 1140 

agctctgata cataatcata atgataatta tttcaatgca caactacggg tggtgctgct 1200 

cgtgcc 1206 

<210> 14 

<211> 317 

<212> PRT 

<213> Homo sapien 



<400> 14 
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Met 


Gly 


Thr 


Arg Ala 


Leu 


Gin 


Cys Glu 


Val 


Ser 


His 


inr 


His 


Glu 


Asn 










D 








10 










15 




Glu 


Asn 


Tvi* 


Leu 


Leu 


His 


Glu 


Asn Cys 


Met 


Leu 


Lys 


Lys 


Glu 


He 


A±a 
















25 










30 








Leu 


Lys 


Leu 


Glu 


He 


Ala 


Thr Leu 


Lys 


His 


Gin 


Tyr 


Gin 


Glu 


Lys 






35 










40 








45 






Glu 


Asn 


Lys 


Tyr 


Phe 


Glu 


Asp 


He Lys 


He 


Leu 


Lys 


Glu 


Lys Asn 


Axa 




50 










55 








60 










Glu 


Leu 


Gin 


Met 


Thr 


Leu 


Lys 


Leu Lys 


Glu 


Glu 


Ser 


Leu 


Thr 


Lys 


Arg 


65 










70 








75 










0 u 


Ala 


Sec 


Gin 


Tyr 


Ser 


Gly Gin Leu Lys Val 


Leu 


He 


Ala 


Glu 


Asn 


Thr 










85 








90 










95 






Leu 


Thr 


Ser 


Lys 


Leu 


Lys 


Glu Lys 


Gin Asp Lys 


Glu 


He 


Leu 


Glu 








100 








105 










110 






a 1 a 


o±U 


He 


Glu 


Ser 


His 


His 


Pro Arg 


Leu 


Ala 


Ser 


Ala 


Val 


Gin 


Asp 






115 










120 








125 








nj. o 


Asp 


Gin 


He 


Val 


Thr 


Ser Arg Lys 


Ser 


Gin 


Glu 


Pro 


Ala 


Phe 


HXS 














135 








140 














Gly Asp Ala 


Cys 


Leu 


Gin Arg 


Lys 


Met 


Asn 


Val Asp Val 


Ser 












150 








155 










160 




inj. 


He 


Tyr 


Asn 


Asn 


Glu 


Val Leu 


His 


Gin 


Pro 


Leu 


Ser 


Glu 


Ala 










165 








170 










175 






Arg 


Lys 


Ser 


Lys 


Ser 


Leu 


Lys He 


Asn 


Leu 


Asn 


Tyr 


Ala 


Gly 


Asp 








180 








185 










190 






TV T -5 


Leu 


Arg 


Glu 


Asn 


Thr 


Leu 


Val Ser 


Glu 


His 


Ala 


Gin 


Arg 


Asp 


Gin 






195 










200 








205 








Arg 


Glu 


Thr 


Gin 


Cys 


Gin 


Met 


Lys Glu 


Ala 


Glu 


His 


Met 


Tyr 


Gin 


Asn 




210 










215 








220 








Glu 


Gin 


Asp 


Asn 


Val 


Asn 


Lys 


His Thr 


Glu 


Gin 


Gin 


Glu 


Ser 


Leu 


Asp 


225 










230 








235 










240 


Gin 


Lys 


Leu 


Phe 


Gin 


Leu Gin Ser Lys Asn *Met Trp Leu Gin Gin 


Gin 










245 








250 










255 




Leu 


Val 


His 


Ala 


His 


Lys 


Lys Ala Asp Asn Lys 


Ser 


Lys 


He 


Thr 


He 








260 








265 










270 






Asp 


He 


His 


Phe 


Leu 


Glu Arg Lys Met Gin His 


His 


Leu 


Leu 


Lys 


Glu 






275 










280 








285 






Lys 


Asn 


Glu 


Glu 


He 


Phe 


Asn 


Tyr Asn 


Asn 


His 


Leu 


Lys 


Asn 


Arg 


He 




290 










295 








300 










Tyr 


Gin 


Tyr 


Glu 


Lys 


Glu 


Lys 


Ala Glu 


Thr 


Glu 


Val 


He 








305 










310 








315 












<210> 15 



























<2H> 1665 

<212> DNA 

<213> Homo sapien 



<400> 15 

gcaaactttc aagcagagcc tcccgagaag ccatctgcct tcgagcctgc cattgaaatg 60 

caaaagtctg ttccaaataa agccttggaa ttgaagaatg aacaaacatt gagagcagat 120 

cagatgttcc cttcagaatc aaaacaaaag aaggttgaag aaaattcttg ggattctgag 180 

agtctccgtg agactgtttc acagaaggat gtgtgtgtac ccaaggctac acatcaaaaa 240 

gaaatggata aaataagtgg aaaattagaa gattcaacta gcctatcaaa aatcttggat 300 

acagttcatt cttgtgaaag agcaagggaa cttcaaaaag atcactgtga acaacgtaca 360 

ggaaaaatgg aacaaatgaa aaagaagttt tgtgtactga aaaagaaact gtcagaagca 420 

aaagaaataa aatcacagtt agagaaccaa aaagttaaat gggaacaaga gctctgcagt 4 80 

gtgaggtttc tcacactcat gaaaatgaaa attatctctt acatgaaaat tgcatgttga 540 

aaaaggaaat tgccatgcta aaactggaaa tagccacact gaaacaccaa taccaggaaa 600 

aggaaaataa atactttgag gacattaaga ttttaaaaga aaagaatgct gaacttcaga 660 
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tgaccctaaa actgaaagag gaatcattaa ctaaaagggc atctcaatat agtgggcagc 720 

ttaaagttct gatagctgag aacacaatgc tcacttctaa attgaaggaa aaacaagaca 780 

aagaaatact agaggcagaa attgaatcac accatcctag actggcttct gctgtacaag 840 

accatgatca aattgtgaca tcaagaaaaa gtcaagaacc tgctttccac attgcaggag 900 

atgcttgttt gcaaagaaaa atgaatgttg atgtgagtag tacgatatat aacaatgagg 960 

tgctccatca accactttct gaagctcaaa ggaaatccaa aagcctaaaa attaatctca 1020 

attatgccgg agatgctcta agagaaaata cattggtttc agaacatgca caaagagacc 1080 

aacgtgaaac acagtgtcaa atgaaggaag ctgaacacat gtatcaaaac gaacaagata 1140 

atgtgaacaa acacactgaa cagcaggagt ctctagatca gaaattattt caactacaaa 1200 

gcaaaaatat gtggcttcaa cagcaattag ttcatgcaca taagaaagct gacaacaaaa 1260 

gcaagataac aattgatatt cattttcttg agaggaaaat gcaacatcat ctcctaaaag 1320 

agaaaaatga ggagatattt aattacaata accatttaaa aaaccgtata tatcaatatg 1380 

aaaaagagaa agcagaaaca gaaaactcat gagagacaag cagtaagaaa cttcttttgg 1440 

agaaacaaca gaccagatct ttactcacaa ctcatgctag gaggccagtc ctagcattac 1500 

cttatgttga aaatcttacc aatagtctgt gtcaacagaa tacttatttt agaagaaaaa 1560 

ttcatgattt cttcctgaag cctgggcgac agagcgagac tctgtctcaa aaaaaaaaaa 1620 

aaaaaaagaa agaaagaaat gcctgtgctt acttcgcttc ccagg 1665 

<210> 16 

<211> 179 

<212> PRT 

<213> Homo sapien 



<400> 16 



Ala 


Asn 


Phe 


Gin 


Ala 


Glu 


Pro 


Pro 


Glu 


Lys 


Pro Ser 


Ala 


Phe Glu 


Pro 


1 








5 










10 






15 




Ala 


lie 


Glu 


Met 


Gin 


Lys 


Ser 


Val 


Pro 


Asn 


Lys Ala 


Leu 


Glu Leu 


Lys 








20 










25 








30 




Asn 


Glu 


Gin 


Thr 


Leu 


Arg 


Ala 


Asp 


Gin 


Met 


Phe Pro 


Ser 


Glu Ser 


Lys 






35 










40 








45 






Gin 


Lys 


Lys 


Val 


Glu 


Glu 


Asn 


Ser 


Trp 


Asp 


Ser Glu 


Ser 


Leu Arg 


Glu 




50 










55 








60 








Thr 


Val 


Ser 


Gin 


Lys 


Asp 


Val 


Cys 


Val 


Pro 


Lys Ala 


Thr 


His Gin 


Lys 


65 










70 










75 






80 


Glu 


Met 


Asp 


Lys 


He 


Ser Gly Lys 


Leu 


Glu 


Asp Ser 


Thr 


Ser Leu 


Ser 










85 










90 






95 




Lys 


He 


Leu 


Asp 


Thr 


Val 


His 


Ser 


Cys 


Glu 


Arg Ala 


Arg 


Glu Leu 


Gin 








100 










105 








110 




Lys 


Asp 


His 


Cys 


Glu 


Gin Arg 


Thr 


Gly 


Lys 


Met Glu 


Gin Met Lys 


Lys 






115 










120 








125 






Lys 


Phe 


Cys 


Val 


Leu 


Lys 


Lys 


Lys 


Leu 


Ser 


Glu Ala 


Lys 


Glu He 


Lys 




130 










135 








140 








Ser 


Gin 


Leu 


Glu 


Asn 


Gin Lys Val 


Lys 


Trp 


Glu Gin 


Glu 


Leu Cys 


Ser 


145 










150 










155 






160 


Val 


Arg 


Phe 


Leu 


Thr 


Leu 


Met 


Lys 


Met 


Lys 


He He 


Ser 


Tyr Met 


Lys 








165 










170 






175 





He Ala Cys 



<210> 17 
<211> 1681 
<212> DNA 
<213> Homo sapien 

<400> 17 

gatacagtca ttcttgtgaa agagcaaggg aacttcaaaa agatcactgt gaacaacgta 60 

caggaaaaat ggaacaaatg aaaaagaagt tttgtgtact gaaaaagaaa ctgtcagaag 120 

caaaagaaat aaaatcacag ttagagaacc aaaaagttaa atgggaacaa gagctctgca 180 
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gtgtgagatt 
aaaaaattag 
aacaacaact 
atttgaatca 
tgttgaaaaa 
aggaaaagga 
ttcagatgac 
ggcagcttaa 
aagacaaaga 
tacaagacca 
caggagatgc 
atgaggtgct 
atctcaatta 
gagaccaacg 
aagataatgt 
tacaaagcaa 
acaaaagcaa 
taaaagagaa 
aatatgaaaa 
ttttggagaa 
cattacctta 
agaaaaattc 
cttgttcacg 
gacatcattc 
aatctcggct 
9 



gactttaaac 
ggaagaatta 
tgaacaggct 
ggtttctcac 
ggaaattgcc 
aaataaatac 
cctaaaactg 
agttctgata 
aatactagag 
tgatcaaatt 
ttgtttgcaa 
ccatcaacca 
tgccggagat 
tgaaacacag 
gaacaaacac 
aaatatgtgg 
gataacaatt 
aaatgaggag 
agagaaagca 
acaacagacc 
tgttgaaaaa 
atgatttctt 
aattgcataa 
aatccaacca 
cactgcaact 



caagaagaag. 

ggaagaatcg 

ctcagaatac 

actcatgaaa 

atgctaaaac 

tttgaggaca 

aaagaggaat 

gctgagaaca 

gcagaaattg 

gtgacatcaa 

agaaaaatga 

ctttctgaag 

gctctaagag 

tgtcaaatga 

actgaacagc 

cttcaacagc 

gatattcatt 

atatttaatt 

gaaacagaaa 

agatctttac 

tcttaccaat 

cctgaagcct 

aagctgccca 

gaatctcgct 

ctgcctccca 



agaagagaag 
aagagcagca 
aagatataga 
atgaaaatta 
tggaaatagc 
ttaagatttt 
cattaactaa 
caatgctcac 
aatcacacca 
gaaaaagtca 
atgttgatgt 
ctcaaaggaa 
aaaatacatt 
ag*gaagctga 
aggagtctct 
aattagttca 
ttcttgagag 
acaataacca 
actcatgaga 
tcacaactca 
agtctgtgtc 
acagacataa 
ggatttccat 
ctgtcactca 
ggttcacgcc 



aaatgccgat 
taggaaagag 
attgaagagt 
tctcttacat 
cacactgaaa 
aaaagaaaag 
aagggcatct 
ttctaaattg 
tcctagactg 
agaac'ctgct 
gagtagtacg 
atccaaaagc 
ggtttcagaa 
acacatgtat 
agatcagaaa 
tgcacataag 
gaaaatgcaa 
tttaaaaaac 
gacaagcagt 
tgctaggagg 
aacagaatac 
aataacagtg 
ctaccctgga 
ggctggagtg 
attctctggc 



atattaaatg 
ttagaagtga 
gtagaaagta 
gaaaattgca 
caccaatacc 
aatgctgaac 
caatatagtg 
aaggaaaaac 
gcttctgctg 
ttccacattg 
atatataaca 
ctaaaaatta 
catgcacaaa 
caaaacgaac 
ttatttcaac 
aaagctgaca 
catcatctcc 
cgtatatatc 
aagaaacttc 
ccagtcctag 
ttattttaga 
tgaagaatta 
tgatgccgga 
cagtgggcgc 
acagcctccc 



240 
300 
.360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1681 





<210> 


18 




























<211> 


432 




























<212> 


PRT 




























<213> 


Homo sapien 
























<400> 


18 


























Asp 


Thr 


Vai 


His 


Ser 


Cys 


Glu 


Arg 


Ala 


Arg 


Glu 


Leu 


Gin 


Lys 


Asp 


His 


1 








5 










10 










15 




Cys 


Glu 


Gin 


Arg 


Thr 


Gly 


Lys 


Met 


Glu 


Gin 


Met 


Lys 


Lys 


Lys 


Phe 


Cys 








20 










25 










30 






Val 


Leu 


Lys 


Lys 


Lys 


Leu 


Ser 


Glu 


Ala 


Lys 


Glu 


He 


Lys 


Ser 


Gin 


Leu 






35 










40 










45 








Glu 


Asn 


Gin 


Lys 


Val 


Lys 


Trp 


Glu 


Gin 


Glu 


Leu 


Cys 


Ser 


Val 


Arg 


Leu 




50 










55 










60 










Thr 


Leu 


Asn 


Gin 


Glu 


Glu 


Glu 


Lys 


Arg 


Arg 


Asn 


Ala 


Asp 


He 


Leu 


Asn 


65 










70 










75 










80 


Glu 


Lys 


He 


Arg 


Glu 


Glu 


Leu 


Gly 


Arg 


He 


Glu 


Glu 


Gin 


His 


Arg 


Lys 










85 










90 










95 




Glu 


Leu 


Glu 


Val 


Lys 


Gin 


Gin 


Leu 


Glu 


Gin 


Ala 


Leu 


Arg 


He 


Gin 


Asp 








100 










105 










110 






He 


Glu 


Leu 


Lys 


Ser 


Val 


Glu 


Ser 


Asn 


Leu 


Asn 


Gin 


Val 


Ser 


His 


Thr 






115 










120 










125 








His 


Glu 


Asn 


Glu 


Asn 


Tyr 


Leu 


Leu 


His 


Glu 


Asn 


Cys 


Met 


Leu 


Lys 


Lys 




130 










135 










140 










Glu 


He 


Ala 


Met 


Leu 


Lys 


Leu 


Glu 


He 


Ala 


Thr 


Leu 


Lys 


His 


Gin 


Tyr 


145 










150 










155 










160 


Gin 


Glu 


Lys 


Glu 


Asn 


Lys 


Tyr 


Phe 


Glu 


Asp 


He 


Lys 


He 


Leu 


Lys 


Glu 










165 










170 










175 




Lys 


Asn 


Ala 


Glu 


Leu 


Gin 


Met 


Thr 


Leu 


Lys 


Leu 


Lys 


Glu 


Glu 


Ser 


Leu 








180 










185 










190 






Thr 


Lys 


Arg 


Ala 


Ser 


Gin 


Tyr 


Ser 


Gly 


Gin 


Leu 


Lys 


Val 


Leu 


He 


Ala 
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195 










200 








O c 








Glu 


Asn 


Thr 


Met 


Leu 


Thr 


Ser 


Lys 


Leu 


Lys Glu 


Lys 


Gin 


Asp 


Lys 


Glu 




210 










215 








220 










lie 


Leu 


Glu 


Ala 


Glu 


He 


Glu 


Ser 


His 


His Pro 


Arg 


Leu 


Ala 


Ser 


Ala 


225 










230 








235 










2 40 


Val 


Gin 


Asp 


His 


Asp 


Gin 


He 


Val 


Thr 


Ser Arg 


Lys 


Ser 


Gin 


Glu 


Pro 










245 










250 








255 




Ala 


Phe 


His 


lie 


Ala 


Gly Asp Ala 


Cys 


Leu Gin Arg 


Lys 


Met 


Asn 


Val 








260 










265 








270 






Asp 


Val 


Ser 


Ser 


Thr 


He 


Tyr 


Asn 


Asn 


Glu Val 


Leu 


His 


Gin 


Pro 


Leu 






275 










280 








285 








Ser 


Glu 


Ala 


Gin Arg 


Lys 


Ser 


Lys 


Ser 


Leu Lys 


He 


Asn 


Leu 


Asn 


Tyr 




290 










295 








300 










Ala 


Gly 


Asp 


Ala 


Leu 


Arg 


Glu 


Asn 


Thr 


Leu Val 


Ser 


Glu 


His 


Ala 


Gin 


305 










310 








315 










320 


Arg 


Asp 


Gin 


Arg 


Glu 


Thr 


Gin 


Cys 


Gin 


Met Lys 


Glu 


Ala 


Glu 


His 


Met 










325 










330 








335 




Tyr 


Gin 


Asn 


Glu 


Gin 


Asp 


Asn 


Val 


Asn 


Lys His 


Thr 


Glu 


Gin 


Gin 


Glu 








340 










345 








350 






Ser 


Leu 


Asp 


Gin 


Lys 


Leu 


Phe 


Gin 


Leu 


Gin Ser 


Lys 


Asn 


Met 


Trp 


Leu 






355 










360 








365 








Gin 


Gin 


Gin 


Leu 


Val 


His 


Ala 


His 


Lys 


Lys Ala 


Asp 


Asn Lys 


Ser 


Lys 




370 










375 








380 










He 


Thr 


He 


Asp 


He 


His 


Phe Leu Glu Arg Lys Met 


Gin 


His 


His 


Leu 


385 










390 








395 










400 


Leu 


Lys 


Glu 


Lys 


Asn 


Glu 


Glu 


He 


Phe 


Asn Tyr 


Asn 


Asn 


His 


Leu 


Lys 










405 










410 








415 




Asn 


Arg 


He 


Tyr 


Gin 


Tyr 


Glu Lys 


Glu Lys Ala Glu 


Thr 


Glu 


Asn 


Ser 








420 










425 








430 







<210> 19 

<2H> 3681 

<212> DNA . 

<213> Homo sapiens 

<400> 19 

tccgagctga ttacagacac caaggaagat gctgtaaaga gtcagcagcc 
ctagctggcc ctgtgggcat ttattagtaa agttttaatg acaaaagctt 
cacccgtggg taattaacct ggtcatcccc accctggaga gccatcctgc 
tcaaagaagg aacatctgca ggaacacctg atgaggctgc acccttggcg 
ctgacacagc tgaaagcttg gtggaaaaaa cacctgatga ggctgcaccc 
gaacacctga cacggctgaa agcttggtgg aaaaaacacc tgatgaggct 
tggagggaac atctgacaaa attcaatgtt tggagaaagc gacatctgga 
agtcagcaga agaaacacct agggaaatta cgagtcctgc aaaagaaaca 
ttacgtggcc agcaaaagga agacctagga agatcgcatg ggagaaaaaa 
ctagggaaat tatgagtccc gcaaaagaaa catctgagaa atttacgtgg 
gaagacctag gaagatcgca tgggagaaaa aagaaacacc tgtaaagact 
caagagtaac atctaataaa actaaagttt tggaaaaagg aagatctaag 
gtcctacaaa agaatcatct acaaaagcaa gtgccaatga tcagaggttc 
ccaaacaaga ggaagatgaa gaatattctt gtgattctcg gagtctcttt 
caaagattca agtgtgtata cctgagtcta tatatcaaaa agtaatggag 
aagtagaaga gcctcctaag aagccatctg ccttcaagcc tgccattgaa 
ctgttccaaa taaagccttt gaattgaaga atgaacaaac attgagagca 
tcccaccaga atccaaacaa aaggactatg aagaaaattc ttgggattct 
gtgagactgt ttcacagaag gatgtgtgtt tacccaaggc tacacatcaa 
ataaaataaa tggaaaatta gaagagtctc ctaataaaga tggtcttctg 
gcggaatgaa agtttctatt ccaactaaag ccttagaatt gaaggacatg 
aagcagagcc tccggggaag ccatctgcct tcgagcctgc cactgaaatg 



acagccctgg 60 
tgagtcaaca 120 
ccatgggtga 180 
gaaagaacac 240 
ttggtggaaa 300 
gcatccttgg 360 
aagttcgaac 420 
tctgagaaat 480 
gaagacacac 540 
gcagcaaaag 600 
ggatgcgtgg 660 
atgattgcat 720 
ccatcagaat 7 80 
gagagttctg 840 
ataaatagag 900 
atgcaaaact 960 
gatccgatgt 1020 
gagagtctct 1080 
aaagaaatag 1140 
aaggctacct 1200 
caaactttca 1260 
caaaagtctg 1320 
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tcccaaataa agccttggaa ttgaaaaatg aacaaacatt gagagcagat gagatactcc 1380 

catcagaatc caaacaaaag gactatgaag aaagttcttg ggattctgag agtctctgtg 1440 

agactgtttc acagaaggat gtgtgtttac ccaaggctrc rcatcaaaaa gaaatagata 1500 

aaataaatgg aaaattagaa gggtctcctg ttaaagatgg tcttctgaag gctaactgcg 1560 

gaatgaaagt ttctattcca actaaagcct tagaattgat ggacatgcaa actttcaaag 1620 

cagagcctcc cgagaagcca tctgccttcg agcctgccat tgaaatgcaa aagtctgttc 1680 

caaataaagc cttggaattg aagaatgaac aaacattgag agcagatgag atactcccat 1740 

cagaatccaa acaaaaggac tatgaagaaa gttcttggga ttctgagagt ctctgtgaga 1800 

ctgtttcaca gaaggatgtg tgtttaccca aggctrcrca tcaaaaagaa atagataaaa 1860 

taaatggaaa attagaagag tctcctgata atgatggttt tctgaaggct ccctgcagaa 1920 

tgaaagtttc tattccaact aaagccttag aattgatgga catgcaaact ttcaaagcag 1980 

agcctcccga gaagccatct gccttcgagc ctgccattga aatgcaaaag tctgttccaa 2040 

ataaagcctt ggaattgaag aatgaacaaa cattgagagc agatcagatg ttcccttcag 2100 

aatcaaaaca aaagaasgtt gaagaaaatt cttgggattc tgagagtctc cgtgagactg 2160 

tttcacagaa ggatgtgtgt gtacccaagg ctacacatca aaaagaaatg gataaaataa 2220 

gtggaaaatt agaagattca actagcctat caaaaatctt ggatacagtt cattcttgtg 2280 

aaagagcaag ggaacttcaa aaagatcact gtgaacaacg tacaggaaaa atggaacaaa 2340 

tgaaaaagaa gttttgtgta ctgaaaaaga aactgtcaga agcaaaagaa ataaaatcac 2400 

agttagagaa ccaaaaagtt aaatgggaac aagagctctg cagtgtgagg tttctcacac 2460 

tcatgaaaat gaaaattatc tcttacatga aaattgcatg ttgaaaaagg aaattgccat 2520 

gctaaaactg gaaatagcca cactgaaaca ccaataccag gaaaaggaaa ataaatactt 2580 

tgaggacatt aagattttaa aagaaaagaa tgctgaactt cagatgaccc taaaactgaa 2640 

agaggaatca ttaactaaaa gggcatctca atatagtggg cagcttaaag ttctgatagc 2700 

tgagaacaca atgctcactt ctaaattgaa ggaaaaacaa gacaaagaaa tactagaggc 27 60 

agaaattgaa tcacaccatc ctagactggc ttctgctgta caagaccatg atcaaattgt 2820 

gacatcaaga aaaagtcaag aacctgcttt ccacattgca ggagatgctt gtttgcaaag 2 880 

aaaaatgaat gttgatgtga gtagtacgat atataacaat gaggtgctcc atcaaccact 294 0 

ttctgaagct caaaggaaat ccaaaagcct aaaaattaat ctcaattatg cmggagatgc 3000 

tctaagagaa aatacattgg tttcagaaca tgcacaaaga gaccaacgtg aaacacagtg 3060 

tcaaatgaag gaagctgaac acatgtatca aaacgaacaa gataatgtga acaaacacac 3120 

tgaacagcag gagtctctag atcagaaatt atttcaacta caaagcaaaa atatgtggct 3180 

tcaacagcaa ttagttcatg cacataagaa agctgacaac aaaagcaaga taacaattga 3240 

tattcatttt cttgagagga aaatgcaaca tcatctccta aaagagaaaa atgaggagat 3300 

atttaattac aataaccatt taaaaaaccg tatatatcaa tatgaaaaag agaaagcaga 3360 

aacagaaaac tcatgagaga caagcagtaa gaaacttctt ttggagaaac aacagaccag 3420 

atctttactc acaactcatg ctaggaggcc agtcctagca tcaccttatg ttgaaaatct 3480 

taccaatagt ctgtgtcaac agaatactta ttttagaaga aaaattcatg atttcttcct 3540 

gaagcctaca gacataaaat aacagtgtga agaattactt gttcacgaat tgcataaagc 3600 

tgcacaggat tcccatctac cctgatgatg cagcagacat cattcaatcc aaccagaatc 3660 
tcgctctgtc actcaggptg g 3681 

<210> 20 

<211> 1424 

<212> DNA 

<213> Homo sapiens 

<400> 20 

tccgagctga ttacagacac caaggaagat gctgtaaaga gtcagcagcc acagccctgg 60 

ctagctggcc ctgtgggcat ttattagtaa agttttaatg acaaaagctt tgagtcaaca 120 

cacccgtggg taattaacct ggtcatcccc accctggaga gccatcctgc ccatgggtga 180 

tcaaagaagg aacatctgca ggaacacctg atgaggctgc acccttggcg gaaagaacac 240 

ctgacacagc tgaaagcttg gtggaaaaaa cacctgatga ggctgcaccc ttggtggaaa 300 

gaacacctga cacggctgaa agcttggtgg aaaaaacacc tgatgaggct gcatccttgg 360 

tggagggaac atctgacaaa attcaatgtt tggagaaagc gacatctgga aagttcgaac 420 

agtcagcaga agaaacacct agggaaatta cgagtcctgc aaaagaaaca tctgagaaat 4 80 

ttacgtggcc agcaaaagga agacctagga agatcgcatg ggagaaaaaa gaagacacac 540 

ctagggaaat tatgagtccc gcaaaagaaa catctgagaa atttacgtgg gcagcaaaag 600 

gaagacctag gaagatcgca tgggagaaaa aagaaacacc tgtaaagact ggatgcgtgg 660 
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caagagtaac atctaataaa actaaagttt 
gtcctacaaa agaatcatct acaaaagcaa 
ccaaacaaga ggaagatgaa gaatattctt 
caaagattca agtgtgtata cctgagtcta 
aagtagaaga gcctcctaag aagccatctg 
ctgttccaaa taaagccttt gaattgaaga 
tcccaccaga atccaaacaa aaggactatg 
gtgagactgt ttcacagaag gatgtgtgtt 
ataaaataaa tggaaaatta gaaggtaaga 
aatatttctc taaactgatg aggagggata 
aaatgcaaac catggaaaaa aagagaagtg 
tgttggcaac agactatatt gagagtgctg 
agatattgca agacctgaga gaaaaaaaaa 



tggaaaaagg aagatctaag atgattgcat 720 
gtgccaatga tcagaggttc ccatcagaat 780 
gtgattctcg gagtctcttt gagagttctg 840 
tatatcaaaa agtaatggag ataaatagag 900 
ccttcaagcc tgccattgaa atgcaaaact 960 
atgaacaaac attgagagca gatccgatgt 1020 
aagaaaattc ttgggattct gagagtctct 1080 
tacccaaggc tacacatcaa aaagaaatag 1140 
accgtttttt atttaaaaat cagttgaccg 1200 
tcctctagta gctgaagaaa attacctcct 1260 
caatggtcgt aagttgtatg tctcatcagg 1320 
aaaaggagct gaattattag tttgaattca 1380 
aaaaaaaaaa aaaa 1424 



<210> 21 

<211> 674 

<212> DNA 

<213> Homo sapiens 



<400> 21 

attccgagct gattacagac accaaggaag 
ggctagctgg ccctgtgggc atttattagt 
cacacccgtg ggtaattaac ctggtcatcc 
gatcaaagaa ggaacatctg caggaacacc 
acctgacaca gctgaaagct tggtggaaaa 
aagaacacct gacacggctg aaagcttggt 
ggtggaggga acatctgaca aaattcaatg 
acagtcagca gaagaaacac ctagggaaat 
atttacgtgg ccagcaaaag gaagacctag 
agttaaggca aaaaaaaaaa aaaaaaaaaa 
aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 
aaaaaaaaaa aaaa 



atgctgtaaa gagtcagcag ccacagccct 60 
aaagttttaa tgacaaaagc tttgagtcaa 120 
ccaccctgga gagccatcct gcccatgggt 180 
tgatgaggct gcacccttgg cggaaagaac 240 
aacacctgat gaggctgcac ccttggtgga 300 
ggaaaaaaca cctgatgagg ctgcatcctt 360 
tttggagaaa gcgacatctg gaaagttcga 420 
tacgagtcct gcaaaagaaa catctgagaa 480 
gaagatcgca tgggagaaaa aagatgactc 540 
aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 600 
aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 660 

674 



<210> 22 

<211> 1729 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> unsure 

<222> (11) 

<223> n=A,T,C or G 

<221> unsure 

<222> (1128) 

<223> n=A,T,C or G 



<400> 22 

gaaagttcga ncagtcagca gaagaaacac ctagggaaat tacgagtcct gcaaaagaaa 60 
catctgagaa atttacgtgg ccagcaaaag gaagacctag gaagatcgca tgggagaaaa 120 
aagaagacac acctagggaa attatgagtc ccgcaaaaga aacatctgag aaatttacgt 180 
gggcagcaaa aggaagacct aggaagatcg catgggagaa aaaagaaaca cctgtaaaga 240 
ctggatgcgt ggcaagagta acatctaata aaactaaagt tttggaaaaa ggaagatcta 300 
agatgattgc atgtcctaca aaagaatcat ctacaaaagc aagtgccaat gatcagaggt 360 
tcccatcaga atccaaacaa gaggaagatg aagaatattc ttgtgattct cggagtctct 420 
ttgagagttc tgcaaagatt caagtgtgta tacctgagtc tatatatcaa aaagtaatgg 480 
agataaatag agaagtagaa gagcctccta agaagccatc tgccttcaag cctgccattg 540 
aaatgcaaaa ctctgttcca aataaagcct ttgaattgaa gaatgaacaa acattgagag 600 
cagatccgat gttcccacca gaatccaaac aaaaggacta tgaagaaaat tcttgggatt 660 
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ctgagagtct ctgtgagact gtttcacaga aggatgtgtg tttacccaag gctacacatc 720 
aaaaagaaat agataaaata aatggaaaat tagaagagtc tcctaataaa gatggtcttc 7 80 
tgaaggctac ctgcggaatg aaagtttcta ttccaactaa agccttagaa ttgaaggaca 840 
tgcaaacttt caaagcagag cctccgggga agccatctgc cttcgagcct gccactgaaa 900 
tgcaaaagtc tgtcccaaat aaagccttgg aattgaaaaa tgaacaaaca ttgagagcag 960 
atgagatact cccatcagaa tccaaacaaa aggactatga agaaaattct tgggatactg 1020 
agagtctctg tgagactgtt tcacagaagg atgtgtgttt acccaaggct gcgcatcaaa 1080 
aagaaataga taaaataaat ggaaaattag aagggtctcc tggtaaanat ggtcttctga 1140 
aggctaactg cggaatgaaa gtttctattc caactaaagc cttagaattg atggacatgc 1200 
aaactttcaa agcagagcct cccgagaagc catctgcctt cgagcctgcc attgaaatgc 1260 
aaaagtctgt tccaaataaa gccttggaat tgaagaatga acaaacattg agagcagatg 1320 
agatactccc atcagaatcc aaacaaaagg actatgaaga aagttcttgg gattctgaga 1380 
gtctctgtga gactgtttca cagaaggatg tgtgtttacc caaggctgcg catcaaaaag 1440 
aaatagataa aataaatgga aaattagaag gtaagaaccg ttttttattt aaaaatcatt 1500 
tgaccaaata tttctctaaa ttgatgagga aggatatcct ctagtagctg aagaaaatta 1560 
cctcctaaat gcaaaccatg gaaaaaaaga gaagtgcaat ggtcataagc tatgtgtctc 1620 
atcaggcatt ggcaacagac tatattgtga gtgctgaaga ggagctgaat tactagttta 1680 
aattcaagat attccaagac gtgaggaaaa tgagaaaaaa aaaaaaaaa 1729 

<210> 23 

<211> 1337 

<212> DNA 

<213> Homo sapiens 

<400> 23 

aaaaagaaat agataaaata aatggaaaat tagaagggtc tcctgttaaa gatggtcttc 60 
tgaaggctaa ctgcggaatg aaagtttcta ttccaactaa agccttagaa ttgatggaca 120 
tgcaaacttt caaagcagag cctcccgaga agccatctgc cttcgagcct gccattgaaa 180 
tgcaaaagtc tgttccaaat aaagccttgg aattgaagaa tgaacaaaca ttgagagcag 240 
atgagatact cccatcagaa tccaaacaaa aggactatga agaaagttct tgggattctg 300 
agagtctctg tgagactgtt tcacagaagg atgtgtgttt acccaaggct gcgcatcaaa 360 
aagaaataga taaaataaat ggaaaattag aagagtctcc tgataatgat ggttttctga 420 
aggctccctg cagaatgaaa gtttctattc caactaaagc cttagaattg atggacatgc 4 80 
aaactttcaa agcagagcct cccgagaagc catctgcctt cgagcctgcc attgaaatgc 540 
aaaagtctgt tccaaataaa gccttggaat tgaagaatga acaaacattg agagcagatc 600 
agatgttccc ttcagaatca aaacaaaaga aggttgaaga aaattcttgg gattctgaga 660 
gtctccgtga gactgtttca cagaaggatg tgtgtgtacc caaggctaca catcaaaaag 720 
aaatggataa aataagtgga aaattagaag attcaactag cctatcaaaa atcttggata 7 80 
cagttcattc ttgtgaaaga gcaagggaac ttcaaaaaga tcactgtgaa caacgtacag 8 40 
gaaaaatgga acaaatgaaa aagaagtttt gtgtactgaa aaagaaactg tcagaagcaa 900 
aagaaataaa atcacagtta gagaaccaaa aagttaaatg ggaacaagag ctctgcagtg 960 
tgagattgac tttaaaccaa gaagaagaga agagaagaaa tgccgatata ttaaatgaaa 1020 
aaattaggga agaattagga agaatcgaag agcagcatag gaaagagtta gaagtgaaac 1080 
aacaacttga acaggctctc agaatacaag atatagaatt gaagagtgta gaaagtaatt 1140 
tgaatcaggt ttctcacact catgaaaatg aaaattatct cttacatgaa aattgcatgt 1200 
tgaaaaagga aattgccatg ctaaaactgg aaatagccac actgaaacac caataccagg 1260 
aaaaggaaaa taaatacttt gaggacatta agattttaaa agaaaagaat gctgaacttc 1320 
agatgacccc tcgtgcc 1337 

<210> 24 

<211> 2307 

<212> DNA 

<213> Homo sapiens 

<400> 24 

attgagagca gatgagatac tcccatcaga atccaaacaa aaggactatg aagaaagttc 60 

ttgggattct gagagtctct gtgagactgt ttcacagaag gatgtgtgtt tacccaaggc 120 

tacacatcaa aaagaaatag ataaaataaa tggaaaatta gaagggtctc ctgttaaaga 180 
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tggtcttctg aaggctaact gcggaatgaa agtttctatt ccaactaaag ccttagaatt 240 
gatggacatg caaactttca aagcagagcc tcccgagaag ccatctgcct tcgagcctgc 300 
cattgaaatg caaaagtctg ttccaaataa agccttggaa ttgaagaatg aacaaacatt 360 
gagagcagat gagatactcc catcagaatc caaacaaaag gactatgaag aaagttcttg 420 
ggattctgag agtctctgtg agactgtttc acagaaggat gtgtgtttac ccaaggctac 480 
acatcaaaaa gaaatagata aaataaatgg aaaattagaa gagtctcctg ataatgatgg 540 
ttttctgaag tctccctgca gaatgaaagt ttctattcca actaaagcct. tagaattgat 600 
ggacatgcaa actttcaaag cagagcctcc cgagaagcca tctgccttcg agcctgccat 660 
tgaaatgcaa aagtctgttc caaataaagc cttggaattg aagaatgaac aaacattgag 720 
agcagatcag atgttccctt cagaatcaaa acaaaagaac gttgaagaaa attcttggga 780 
ttctgagagt ctccgtgaga ctgtttcaca gaaggatgtg tgtgtaccca aggctacaca 840 
tcaaaaagaa atggataaaa taagtggaaa attagaagat tcaactagcc tatcaaaaat 900 
cttggataca gttcattctt gtgaaagagc aagggaactt caaaaagatc actgtgaaca 960 
acgtacagga aaaatggaac aaatgaaaaa gaagttttgt gtactgaaaa agaaactgtc 1020 
agaagcaaaa gaaataaaat cacagttaga gaaccaaaaa gttaaatggg aacaagagct 1080 
ctgcagtgtg aggtttctca cactcatgaa aatgaaaatt atctcttaca tgaaaattgc 1140 
atgttgaaaa aggaaattgc catgctaaaa ctggaaatag ccacactgaa acaccaatac 1200 
caggaaaagg aaaataaata ctttgaggac attaagattt taaaagaaaa gaatgctgaa 1260 
cttcagatga ccctaaaact gaaagaggaa tcattaacta aaagggcatc tcaatatagt 1320 
gggcagctta aagttctgat agctgagaac acaatgctca cttctaaatt gaaggaaaaa 1380 
caagacaaag aaatactaga ggcagaaatt gaatcacacc atcctagact ggcttctgct 144 0 
gtacaagacc atgatcaaat tgtgacatca agaaaaagtc aagaacctgc tttccacatt 1500 
gcaggagatg cttgtttgca aagaaaaatg aatgttgatg tgagtagtac gatatataac 1560 
aatgaggtgc tccatcaacc actttctgaa gctcaaagga aatccaaaag cctaaaaatt 1620 
aatctcaatt atgcaggaga tgctctaaga gaaaatacat tggtttcaga acatgcacaa 1680 
agagaccaac gtgaaacaca gtgtcaaatg aaggaagctg aacacatgta tcaaaacgaa 1740 
caagataatg tgaacaaaca cactgaacag caggagtctc tagatcagaa attatttcaa 1800 
ctacaaagca aaaatatgtg gcttcaacag caattagttc atgcacataa gaaagctgac 1860 
aacaaaagca agataacaat tgatattcat tttcttgaga ggaaaatgca acatcatctc 1920 
ctaaaagaga aaaatgagga gatatttaat tacaataacc atttaaaaaa ccgtatatat 1980 
caatatgaaa aagagaaagc agaaacagaa aactcatgag agacaagcag taagaaactt 2040 
cttttggaga aacaacagac cagatcttta ctcacaactc atgctaggag gccagtccta 2100 
gcatcacctt atgttgaaaa tcttaccaat agtctgtgtc aacagaatac ttattttaga 2160 
agaaaaattc atgatttctt cctgaagcct acagacataa aataacagtg tgaagaatta 2220 
cttgttcacg aattgcataa agctgcacag gattcccatc taccctgatg atgcagcaga 2280 
catcattcaa tccaaccaga atctcgc 2307 

<210> 25 
<211>- 650 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> unsure 
<222> (310) 

<223> Xaa = Any Amino Acid 
<221> unsure 
<222> (429) 

<223> Xaa ^ Any Amino Acid 
<221> unsure 
<222> (522) 

<223> Xaa = Any Amino Acid 
<400> 25 

Met Ser Pro Ala Lys Glu Thr Ser Glu Lys Phe Thr Trp Ala Ala Lys 
5 10 15 

Gly Arg Pro Arg Lys lie Ala Trp Glu Lys Lys Glu Thr Pro Val Lys 
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25 



30 



Thr Gly Cys Val Ala Arg Val Thr Ser Asn Lys Thr Lys Val Leu Glu 
35 40 45 

Lys Gly Arg Ser Lys Met He Ala Cys Pro Thr Lys Glu Ser Ser Thr 
50 55 60 

Lys Ala Ser Ala Asn Asp Gin Arg Phe Pro Ser Glu Ser Lys Gin Glu 
65 70 75 80 

Glu Asp Glu Glu Tyr Ser Cys Asp Ser Arg Ser Leu Phe Glu Ser Ser 
85 90 95 

Ala Lys He Gin Val Cys He Pro Glu Ser He Tyr Gin Lys Val Met 
100 105 110 

Glu He Asn Arg Glu Val Glu Glu Pro Pro Lys Lys Pro Ser Ala Phe 
115 120 125 

Lys Pro Ala He Glu Met Gin Asn Ser Val Pro Asn Lys Ala Phe Glu 
130 135 140 

Leu Lys Asn Glu Gin Thr Leu Arg Ala Asp Pro Met Phe Pro Pro Glu 
145 150 155 160 

Ser Lys Gin Lys Asp Tyr Glu Glu Asn Ser Trp Asp Ser Glu Ser Leu 
165 170 175 

Cys Glu Thr Val Ser Gin Lys Asp Val Cys Leu Pro Lys Ala Thr His 
180 185 190 

Gin Lys Glu He Asp Lys He Asn Gly Lys Leu Glu Glu Ser Pro Asn 
195 200 205 

) 

Lys Asp Gly Leu Leu Lys Ala Thr Cys Gly Met Lys Val Ser He Pro 
210 215 220 



Thr Lys Ala Leu Glu Leu Lys Asp Met Gin Thr Phe Lys Ala Glu Pro 
225 230 235 240 



Pro Gly Lys Pro Ser Ala Phe Glu 
245 

Val Pro Asn Lys Ala Leu Glu Leu 
260 

Asp Glu He Leu Pro Ser Glu Ser 

275 280 

Ser Trp Asp Ser Glu Ser Leu Cys 
290 295 

Cys Leu Pro Lys Ala Xaa His Gin 
305 310 



Pro Ala Thr Glu Met Gin Lys Ser 
250 255 

Lys Asn Glu Gin Thr Leu Arg Ala 
265 270 

Lys Gin Lys Asp Tyr Glu Glu Ser 
285 

Glu Thr Val Ser Gin Lys Asp Val 
300 

Lys Glu He Asp Lys He Asn Gly 
315 320 . 



Lys Leu Glu Gly Ser Pro Val Lys Asp Gly Leu Leu Lys Ala Asn Cys 
325 330 335 



wo 01/75171 



PCTAJSOl/10631 



21 



Gly Met Lys Val Ser He Pro Thr Lys Ala Leu Glu Leu Met Asp Met 
340 345 350 

Gin Thr Phe Lys Ala Glu Pro Pro Glu Lys Pro Ser Ala Phe Glu Pro 
355 360 365 

Ala He Glu Met Gin Lys Ser Val Pro Asn Lys Ala Leu Glu Leu Lys 
370 375 380 

Asn Glu Gin Thr Leu Arg Ala Asp Glu He Leu Pro Ser Glu Ser Lys 
385 390 395 400 

Gin Lys Asp Tyr Glu Glu Ser Ser Trp Asp Ser Glu Ser Leu Cys Glu 
405 410 415 

Thr Val Ser Gin Lys Asp Val Cys Leu Pro Lys Ala Xaa His Gin Lys 
420 425 430 

Glu He Asp Lys He Asn Gly Lys Leu Glu Glu Ser Pro Asp Asn Asp 
435 440 445 

Gly Phe Leu Lys Ala Pro Cys Arg Mfet Lys Val Ser He Pro Thr Lys 
450 455 460 

Ala Leu Glu Leu Met Asp Met Gin Thr Phe Lys Ala Glu Pro Pro Glu 
465 470 475 480 

Lys Pro Ser Ala Phe Glu Pro Ala He Glu Met Gin Lys Ser Val Pro 
485 490 495 

Asn Lys Ala Leu Glu Leu Lys Asn Glu Gin Thr Leu Arg Ala Asp Gin 
500 505 510 

Met Phe Pro Ser Glu Ser Lys Gin Lys Xaa Val Glu Glu Asn Ser Trp 
515 520 525 

Asp Ser Glu Ser Leu Arg Glu Thr Val Ser Gin Lys Asp Val Cys Val 
530 535 540 

Pro Lys Ala Thr His Gin Lys Glu Met Asp Lys He Ser Gly Lys Leu 
545 550 555 560 

Glu Asp Ser Thr Ser Leu Ser Lys He Leu Asp Thr Val His Ser Cys 
565 570 575 

Glu Arg Ala Arg Glu Leu Gin Lys Asp His Cys Glu Gin Arg Thr Gly 
580 585 590 

Lys Met Glu Gin Met Lys Lys Lys Phe Cys Val Leu Lys Lys Lys Leu 
595 600 605 



Ser Glu Ala Lys Glu He Lys Ser Gin Leu Glu Asn Gin Lys Val Lys 
610 615 620 



Trp Glu Gin Glu Leu Cys Ser Val Arg Phe Leu Thr Leu Met Lys Met 
625 630 635 640 
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Lys lie lie Ser Tyr Met Lys lie Ala Cys 
645 650 



<210> 26 
<211> 228 
<212> PRT 

<213> Homo sapiens 
<400> 26 

Met Ser Pro Ala Lys Glu Thr Ser Glu Lys Phe Thr Trp Ala Ala Lys 
5 10 15 

Gly Arg Pro Arg Lys lie Ala Trp Glu Lys Lys Glu Thr Pro Val Lys 
20 25 30 

Thr Gly Cys Val Ala Arg Val Thr Ser Asn Lys Thr Lys Val Leu Glu 
35 40 45 

Lys Gly Arg Ser Lys Met lie Ala Cys Pro Thr Lys Glu Ser Ser Thr 
50 55 60 

Lys Ala Ser Ala Asn Asp Gin Arg Phe Pro Ser Glu Ser Lys Gin Glu 
65 70 75 80 

Glu Asp Glu Glu Tyr Ser Cys Asp Ser Arg Ser Leu Phe Glu Ser Ser 
85 90 95 

Ala Lys lie Gin Val Cys lie Pro Glu Ser lie Tyr Gin Lys Val Met 
100 105 110 

Glu lie Asn Arg Glu Val Glu Glu Pro Pro Lys Lys Pro Ser Ala Phe 
115 120 125 

Lys Pro Ala He Glu Met Gin Asn Ser Val Pro Asn Lys Ala Phe Glu 
130 135 140 

Leu Lys Asn Glu Gin Thr Leu Arg Ala Asp Pro Met Phe Pro Pro Glu 
145 150 155 160 

Ser Lys Gin Lys Asp Tyr Glu Glu Asn Ser Trp Asp Ser Glu Ser -Leu 
165' 170 175 

Cys Glu Thr Val Ser Gin Lys Asp Val Cys Leu Pro Lys Ala Thr His 
180 185 190 

Gin Lys Glu He Asp Lys He Asn Gly Lys Leu Glu Gly Lys Asn Arg 
195 200 205 

Phe Leu Phe Lys Asn Gin Leu Thr Glu Tyr Phe Ser Lys Leu Met Arg 
210 215 220 

Arg Asp He Leu 
225 



<210> 27 
<211> 154 
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<212> PRT 

<213> Homo sapiens 
<220> 

<221> unsure 
<222> (148) 

<223> Xaa Any Amino Acid 
<400> 27 

Met Arg Leu His Pro Trp Arg Lys Glu His Leu Thr Gin Leu Lys Ala 
5 10 15 

Trp Trp Lys Lys His Leu Met Arg Leu His Pro Trp Trp Lys Glu His 
20 25 30 

Leu Thr Arg Leu Lys Ala Trp Trp Lys Lys His Leu Met Arg Leu His 
35 40 45 

Pro Trp Trp Arg Glu His Leu Thr Lys Phe Asn Val Trp Arg Lys Arg 
50 55 60 

His Leu Glu Ser Ser Asn Ser Gin Gin Lys Lys His Leu Gly Lys Leu 
65 70 75 80 

Arg Val Leu Gin Lys Lys His Leu Arg Asn Leu Arg Gly Gin Gin Lys 
85 90 95 

Glu Asp Leu Gly Arg Ser His Gly Arg Lys Lys Met Thr Gin Leu Arg 
100 105 110 

Gin Lys Lys Lys Lys Lys Lys Lys Lys Lys Lys Lys Lys Lys Lys Lys 
115 120 125 

Lys Lys Lys Lys Lys Lys Lys Lys Lys Lys Lys Lys Lys Lys Lys Lys 
130 135 140 

Lys Lys Lys Xaa Lys Lys Lys Lys Lys Lys 
145 150 



<210> 28 
<211> 466 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> unsure 
<222> (329) 

<223> Xaa = Any Amino Acid 
<400> 28 

Met Ser Pro Ala Lys Glii Thr Ser Glu Lys Phe Thr Trp Ala Ala Lys 
5 10 15 

Gly Arg Pro Arg Lys He Ala Trp Glu Lys Lys Glu Thr Pro Val Lys 
20 25 30 



Thr Gly Cys Val Ala Arg Val Thr Ser Asn Lys Thr Lys Val Leu Glu 
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35 



40 



45 



Lys Gly Arg Ser Lys Met lie Ala Cys Pro Thr Lys Glu Ser Ser Thr 
50 55 60 

Lys Ala Ser Ala Asn Asp Gin Arg Phe Pro Ser Glu Ser Lys Gin Glu 
65 70 75 80 

Glu Asp Glu Glu Tyr Ser Cys Asp Ser Arg Ser Leu Phe Glu Ser Ser 
85 90 95 

Ala Lys lie Gin Val Cys lie Pro Glu Ser lie Tyr Gin Lys Val Met 
100 105 110 

Glu lie Asn Arg Glu Val Glu Glu Pro Pro Lys Lys Pro Ser Ala Phe 
115 120 125 

Lys Pro Ala lie Glu Met Gin Asn Ser Val Pro Asn Lys Ala Phe Glu 
130 135 140 

Leu Lys Asn Glu Gin Thr Leu Arg Ala Asp Pro Met Phe Pro Pro Glu 
145 150 155 160 

Ser Lys Gin Lys Asp Tyr Glu Glu Asn Ser Trp Asp Ser Glu Ser Leu 
165 170 175 

Cys Glu Thr Val Ser Gin Lys Asp Val Cys Leu Pro Lys Ala Thr His 
180 185 190 

Gin Lys Glu lie Asp Lys lie Asn Gly Lys Leu Glu Glu Ser Pro Asn 
195 200 205 

Lys Asp Gly Leu Leu Lys Ala Thr Cys Gly Met Lys Val Ser lie Pro 
210 215 220 

Thr Lys Ala Leu Glu Leu Lys Asp Met Gin Thr Phe Lys Ala Glu Pro 
225 230 235 240 

Pro Gly Lys Pro Ser Ala Phe Glu Pro Ala Thr Glu Met Gin Lys Ser 
245 250 255 

Val Pro Asn Lys Ala Leu Glu Leu Lys Asn Glu Gin Thr Leu Arg Ala 
260 265 270 

Asp Glu lie Leu Pro Ser Glu Ser Lys Gin Lys Asp Tyr Glu Glu Asn 
275 280 285 

Ser Trp Asp Thr Glu Ser Leu Cys Glu Thr Val Ser Gin Lys Asp Val 
290 295 300 

Cys Leu Pro Lys Ala Ala His Gin Lys Glu lie Asp Lys lie Asn Gly 
305 310 315 320 



Lys Leu Glu Gly Ser Pro Gly Lys Xaa Gly Leu Leu Lys Ala Asn Cys 
325 330 335 



Gly Met Lys Val Ser lie Pro Thr Lys Ala Leu Glu Leu Met Asp Met 
340 345 350 
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Gin Thr Phe Lys Ala Glu Pro Pro 
355 360 

Ala lie Glu Met Gin Lys Ser Val 
370 375 

Asn Glu Gin Thr Leu Arg Ala Asp 
385 390 

Gin Lys Asp Tyr Glu Glu Ser Ser 
405 

Thr Val Ser Gin Lys Asp Val Cys 
420 

Glu lie Asp Lys lie Asn Gly Lys 
435 440 

Phe Lys Asn His Leu Thr Lys Tyr 
450 455 



Glu Lys Pro Ser Ala Phe Glu Pro 
365 

Pro Asn Lys Ala Leu Glu Leu Lys 
380 

Glu lie Leu Pro Ser Glu Ser Lys 
395 400 

Trp Asp Ser Glu Ser Leu Cys Glu 
410 415 

Leu Pro Lys Ala Ala His Gin Lys 
425 430 

Leu 'gIu Gly Lys Asn Arg Phe Leu 
445 

Phe Ser Lys Leu Met Arg Lys Asp 
4 60 



lie Leu 
465 



<210> 29 
<211> 445 
<212> PRT 

<213> Homo sapiens 
<400> 29 

Lys Glu He Asp Lys He Asn Gly Lys Leu Glu Gly Ser Pro Val Lys 
5 10 15 

Asp Gly Leu Leu Lys Ala Asn Cys Gly Met Lys Val Ser He Pro Thr 
20 25 30 

Lys Ala Leu Glu Leu Met Asp Met Gin Thr Phe Lys Ala Glu Pro Pro 
35 40 45 

Glu Lys Pro Ser Ala Phe Glu Pro Ala He Glu Met Gin Lys Ser Val 
50 55 60 

Pro Asn Lys Ala Leu Glu Leu Lys Asn Glu Gin Thr Leu Arg Ala Asp 
65 70 75 80 

Glu He Leu Pro Ser Glu Ser Lys Gin Lys Asp Tyr Glu Glu Ser Ser 
85 90 95 

Trp Asp Ser Glu Ser Leu Cys Glu Thr Val Ser Gin Lys Asp Val Cys 
100 105 110 

Leu Pro Lys Ala Ala His Gin Lys Glu He Asp Lys He Asn Gly Lys 
115 120 125 



Leu Glu Glu Ser Pro Asp Asn Asp Gly Phe Leu Lys Ala Pro Cys Arg 
130 135 140 
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Met Lys Val Ser He Pro Thr Lys Ala Leu Glu Leu Met Asp Met Gin 
145 150 155 160 

Thr Phe Lys Ala Glu Pro Pro Glu Lys Pro Ser Ala Phe Glu Pro Ala 
165 170 ' 175 

He Glu Met Gin Lys Ser Val Pro Asn Lys Ala Leu Glu Leu Lys Asn 
180 185 190 

Glu Gin Thr Leu Arg Ala Asp Gin Met Phe Pro Ser Glu Ser Lys Gin 
195 200 205 

Lys Lys Val Glu Glu Asn Ser Trp Asp Ser Glu Ser Leu Arg Glu Thr 
210 215 220 

Val Ser Gin Lys Asp Val Cys Val Pro Lys Ala Thr His Gin Lys Glu 
225 230 235 240 

Met Asp Lys He Ser Gly Lys Leu Glu Asp Ser Thr Ser Leu Ser Lys 
245 250 255 

He Leu Asp Thr Val His Ser Cys Glu Arg Ala Arg Glu Leu Gin Lys 
260 265 270 

Asp His Cys Glu Gin Arg Thr Gly Lys Met Glu Gin Met Lys Lys Lys 
275 280 285 

Phe Cys Val Leu Lys Lys Lys Leu Ser Glu Ala Lys Glu He Lys Ser 
290 295 300 

Gin Leu Glu Asn Gin Lys Val Lys Trp Glu Gin Glu Leu Cys Ser Val 
305 310 315 320 

Arg Leu Thr Leu Asn Gin Glu Glu Glu Lys Arg Arg Asn Ala Asp He 
325 330 335 

Leu Asn Glu Lys He Arg Glu Glu Leu Gly Arg He Glu Glu Gin His 
340 345 350 

Arg Lys Glu Leu Glu Val Lys Gin Gin Leu Glu Gin Ala Leu Arg He 
355 360 365 

Gin Asp He Glu Leu Lys Ser Val Glu Ser Asn Leu Asn Gin Val Ser 
370 375 380 

His Thr His Glu Asn Glu Asn Tyr Leu Leu His Glu Asn Cys Met Leu 
385 390 395 400 

Lys Lys Glu He Ala Met Leu Lys Leu Glu He Ala Thr Leu Lys His 
405 410 415 

Gin Tyr Gin Glu Lys Glu Asn Lys Tyr Phe Glu Asp He Lys He Leu 
420 425 430 

Lys Glu Lys Asn Ala Glu Leu Gin Met Thr Pro Arg Ala 
435 440 445 
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<210> 30 

<211> 578 

<212> DNA 

<213> Hioman 



<400> 30 

cttgccttct cttaggcttt gaagcatttt tgtctgtgct ccctgatctt caggtcacca 60 

ccatgaagtt cttagcagtc ctggtactct tgggagtttc catctttctg gtctctgccc 120 

agaatccgac aacagctgct ccagctgaca cgtatccagc tactggtcct gctgatgatg 180 

aagcccctga tgctgaaacc actgctgctg caaccactgc gaccactgct gctcctacca 240 

ctgcaaccac cgctgcttct accactgctc gtaaagacat tccagtttta cccaaatggg 300 

ttggggatct cccgaatggt agagtgtgtc cctgagatgg aatcagcttg agtcttctgc 360 

aattggtcac aactattcat gcttcctgtg atttcatcca actacttacc ttgcctacga 420 

tatccccttt atctctaatc agtttatttt ctttcaaata aaaaataact atgagcaaca 480 

aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 54 0 

aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaa 578 



<210> 31 

<211> 90 

<212> PRT 

<213> Homo sapien 

<400> 31 
Met Lys Phe Leu Ala Val Leu Val 

1 5 
Val Ser Ala Gin Asn Pro Thr Thr 
20 

Ala Thr Gly Pro Ala Asp Asp Glu 

35 40 
Ala Ala Thr Thr Ala Thr Thr Ala 

50 55 
Ala Ser Thr Thr Ala Arg Lys Asp 
65 70 
Gly Asp Leu Pro Asn Gly Arg Val 
85 



Leu Leu Gly Val Ser lie Phe Leu 

10 15 
Ala Ala Pro Ala Asp Thr Tyr Pro 
25 30 
Ala Pro Asp Ala Glu Thr Thr Ala 
45 

Ala Pro Thr Thr Ala Thr Thr Ala . 
60 

lie Pro Val Leu Pro Lys Trp Val 
75 80 

Cys Pro 
90 



<210> 32 
<211> 3101 
<212> DNA 
<213> Homo sapien 



<400> 32 

tgttggggcc tcagcctccc aagtagctgg gactacaggt gcctgccacc acgcccagct 60 

aattttttgt atatttttta gtagagacgg ggtttcaccg tggtctcaat ctcctgacct 120 

cgtgatctgc cagccttggc ctcccaaagt gtattctctt tttattatta ttattatttt 180 

tgagatggag tctgtctctg tcgcccaggc tggagtgcag tggtgcgatc tctgctcact 240 

gcaagctccg cctcctgggt tcatgccatt ctcctgcctc agcctcccga gtagctggga 300 

ctacaggccc ctgccaccac acccggctaa ttttttgtat ttttagtaga gacagggttt 360 

caccatgtta gccagggtgg tctctatctt ctgacctcgt gatccgcctg cctcagtctc 420 

tcaaagtgct gggattacag gcgtgagcca ccgcgaccag ccaactattg ctgtttattt 480 

ttaaatatat tttaaagaaa caattagatt tgttttcttt ctcattcttt tacttctact 540 

cttcatgtat gtataattat atttgtgttt tctattacct tttctccttt tactgtattg 600 

gactataata attgtgctca ctaatttctg ttcactaata ttatcagctt agataatact 660 

ttaattttta acttatatat tgagtattaa attgatcagt tttatttgta attatctatc 720 

ttccgcttgg ctgaatataa cttcttaagc ttataacttc ttgttctttc catgttattt 780 

ttttcttttt tttaatgtat tgaatttctt ctgacactca ttctagtaac ttttttctcg 840 

gtgtgcaacg taagttataa tttgtttctc agatttgaga tctgccataa gtttgaggct 900 

ttattttttt tttttatttg ctttatggca agtcggacaa cctgcatgga tttggcatca 960 
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atgtagtcac ccatatctaa gagcagcact tgcttcttag catgatgagt tgtttctgga 1020 

ttgtttcttt attttactta tattcctggt agattcttat attttccctt caactctatt 1080 

cagcatttta ggaattctta ggactttctg agaattttag ctttctgtat taaatgtttt 1140 

taatgagtat tgcattttct caaaaagcac aaatatcaat agtgtacaca tgaggaaaac 1200 

tatatatata ttctgttgca gatgacagca tctcataaca aaatcctagt tacttcattt 1260 

aaaagacagc tctcctccaa tatactatga ggtaacaaaa atttgtagtg tgtaattttt 1320 

ttaatattag aaaactcatc ttacattgtg cacaaatttc tgaagtgata atacttcact 1380 

gtttttctat agaagtaact taatattggc aaaattactt atttgaattt aggttttggc 1440 

tttcatcata tacttcctca ttaacatttc cctcaatcca taaatgcaat ctcagtttga 1500 

atcttccatt taacccagaa gttaattttt aaaaccttaa taaaatttga atgtagctag 1560 

atattatttg ttggttacat attagtcaat aatttatatt acttacaatg atcagaaaat 1620 

atgatctgaa tttctgctgt cataaattca ataacgtatt ttaggcctaa acctttccat 1680 

ttcaaatcct tgggtctggt aattgaaaat aatcattatc ttttgttttc tggccaaaaa 1740 

tgctgcccat ttatttctat ccctaattag tcaaactttc taataaatgt atttaacgtt 1800 

aatgatgttt atttgcttgt tgtatactaa aaccattagt ttctataatt taaatgtcac 1860 

ctaatatgag tgaaaatgtg tcagaggctg gggaagaatg tggatggaga aagggaaggt 1920 

gttgatcaaa aagtacccaa gtttcagtta cacaggaggc atgagattga tctagtgcaa 1980 

aaaatgatga gtataataaa taataatgca ctgtatattt tgaaattgct aaaagtagat 2040 

ttaaaattga tttacataat attttacata tttataaagc acatgcaata tgttgttaca 2100 

tgtatagaat gtgcaacgat caagtcaggg tatctgtggt atccaccact ttgagcattt 2160 

atcgattcta tatgtcagga acatttcaag ttatctgttc tagcaaggaa atataaaata 2220 

cattatagtt aactatggcc tatctacagt gcaactaaac actagatttt attcctttcc 22 80 

aactgtgggt ttgtattcat ttaccaccct cttttcattc cctttctcac ccacacactg 2340 

tgccgggcct caggcatata ctattctact gtctgtctct gtaaggatta tcattttagc 2400 

ttccacatat gagagaatgc atgcaaagtt tttctttcca tgtctggctt atttcactta 2460 

acaaaatgac ctccgcttcc atccatgtta tttatattac ccaatagtgt tcataaatat 2520 

atatacacac atatatacca cattgcattt gtccaattat tcattgacgg aaactggtta 2580 

atgttatatc gttgctattg tgaatagtgc tgcaataaac acgcaagtgg ggatataatt 2640 

tgaagagttt ttttgttgat gttccataca aattttaaga ttgttttgtc tatgtttgtg 2700 

aaaatggcgt tagtattttc atagagattg cattgaatct gtagattgct ttgggtaagt 27 60 

atggttattt tgatggtatt aattttttca ttccatgaag atgagatgtc tttccatttg 2820 

tttgtgtcct ctacattttc tttcatcaaa gttttgttgt atttttgaag tagatgtatt 2880 

tcaccttata gatcaagtgt attccctaaa tattttattt ttgtagctat tgtagatgaa 2940 

attgccttct cgatttcttt ttcacttaat tcattattag tgtatggaaa tgttatggat 3000 

ttttatttgt tggtttttaa tcaaaaactg tattaaactt agagtttttt gtggagtttt 3060 

taagtttttc tagatataag atcatgacat ctaccaaaaa a 3101 



<210> 33 
<211> 16 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PGR primer 
<400> 33 

tgcccctccg gaagct 16 

<210> 34 
<211> 23 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> PGR primer 
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<400> 34 

cgtttctgaa gggacatctg ate 23 

<210> 35 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PGR primer 
<400> 35 

ttgcagccaa gttaggagtg aagagatgca 30 

<210> 36 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PGR primer 
<400> 36 

aagcctcaga gtccttccag tatg 24 



<210> 37 
<211> 35 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PGR primer 
<400> 37 

ttcaaatata agtgaagaaa aaattagtag atcaa 

<210> 38 
<211> 37 
<212> DNA 

<213> Artificial Sequence 



35 



<220> 

<223> PGR primer 
<400> 38 

aatccattgt atcttagaac cgagggattt gtttaga 37 

<210> 39 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PGR primer 



<400> 39 

aaagcagatg gtggttgagg tt 



22 
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<210> 40 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PGR primer 
<400> 40 

cctgagacca aatggcttct tc 22 

<210> 41 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PGR primer 
<400> 41 

attccatgcc ggctgcttct tctg 24 

<210> 42 * 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PGR primer 
<400> 42 

tctggttttc tcattcttta ttcatttatt 30 

<210> 43 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PGR primer 
<400> 43 

tgccaaggag cggattatct 20 

<210> 44 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PGR primer 
<400> 44 

caaccacgtg acaaacactg gaattacagg 30 

<210> 45 
<211> 21 
<212> DNA 
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<213> Artificial Sequence 
<220> 

<223> PGR primer 
<400> 45 

actggaacgg tgaaggtgac a 21 

<210> 46 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PGR primer 
<400> 46 

cggccacatt gtgaactttg 20 

<210> 47 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PGR primer 
<400> 47 

cagtcggttg gagcgagcat ccc 23 

<210> 48 

<211> 24 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PGR primer 

<400> 48 

tgccatagat gaattgaagg aatg 24 

<210> 49 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PGR primer 
<400> 49 

tgtcatatat taattgcata aacacctca 29 

<210> 50 
<211> 32 
<212> DNA 

<213> Artificial Sequence 



<220> 
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<223> PGR primer 
<400> 50 

tcttaaccaa acggatgaaa ctctgagcaa tg 32 

<210> 51 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PGR primer 
<400> 51 

atcattgaaa attcaaatat aagtgaag 28 



<210> 52 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PGR primer 
<400> 52 

gtagttgtgc attgaaataa ttatcattat 30 



<210> 53 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PGR Primer 
<400> 53 

caattttggt ggagaacccg 20 

<210> 54 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PGR Primer 
<400> 54 

gctgtcggag gtatatggtg 20 

<210> 55 
<211> 28 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> PGR Primer 
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<400> 55 

catttcagag agtaacatgg actacaca 28 

<210> 56 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PGR Primer 
<400> 56 

tctgataaag gccgtacaat g 21 

<210> 57 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PGR Primer 
<400> 57 

tcacgacttg ctgtttttgc tc 22 

<210> 58 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PGR Primer ■ 
<400> 58 

atcaaaaaac aagcatggcc tcacaccact 30 

<210> 59 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PGR Primer 
<400> 53 

gcaagtgcca atgatcagag g 21 

<210> 60 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PGR Primer 



<400> 60 

atatagactc aggtatacac act 



23 
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<210> 61 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PGR Primer 
<400> 61 

tcccatcaga atccaaacaa gaggaagatg 

<210> 62 
<211> 34 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PGR Primer 
<400> 62 

aatccattgt atcttagaac cgagggattt gttt 

<210> 63 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PGR Primer 
<400> 63 

ccgcttctga caacactaga gate 

<210> 64 
<211> 32 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PGR Primer 
<400> 64 

cctataaaga tgttatgtac caaaaatgaa gt 

<210> 65 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PGR Primer 
<400> 65 

cccctccctc agggtatggc cc 



<210> 66 
<211> 22 
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<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PGR Primer 
<400> 66 

ccctttctca cccacacact gt 22 

<210> 67 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PGR Primer 
<400> 67 

tgcattctct catatgtgga agct ' 24 

<210> 68 
<211> 33 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PGR Primer 
<400> 68 

ccgggcctca ggcatatact attctactgt ctg 33 

<210> 69 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PGR Primer 
<400> 69 

gacattccag ttttacccaa atgg 24 

<210> 70 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PGR Primer 
<400> 70 

tgcagaagac tcaagctgat tec 23 

<210> 71 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> PGR Primer 
<400> 71 

tctcagggac acactctacc attcggga 28 

<210> 72 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PGR Primer 
<400> 72 

aaatataagt gaagaaaaaa attagtagat 30 

<210> 73 

<211> 503 

<212> DNA 

<213> Homo sapiens 

<400> 73 

gacagcggct tccttgatcc ttgccacccg cgactgaaca ccgacagcag cagcctcacc 60 

atgaagttgc tgatggtcct catgctggcg gccctctccc agcactgcta cgcaggctct 120 

ggctgcccct tattggagaa tgtgatttcc aagacaatca atccacaagt gtctaagact 180 

gaatacaaag aacttcttca agagttcata gacgacaatg ccactacaaa tgccatagat 240 

gaattgaagg aatgttttct taaccaaacg gatgaaactc tgagcaatgt tgaggtgttt 300 

ctgcaattaa tatatgacag cagtctttgt gatttatttt aactttctgc aagacctttg 360 

gctcacagaa ctgcagggta tggtgagaaa ccaactacgg attgctgcaa accacacctt 420 

ctctttctta tgtcttttta ctacaaacta caagacaatt gttgaaacct gctatacatg 480 

tttattttaa taaattgatg gca 503 

<210> 74 

<211> 301 

<212> DNA 

<213> Homo sapiens 

<400> 74 

cactgctacg caggctctgg ctgcccctta ttggagaatg tgatttccaa gacaatcaat 60 

ccacaagtgt ctaagactga atacaaagaa cttcttcaag agttcataga cgacaatgcc 120 

actacaaatg ccatagatga attgaaggaa tgttttctta accaaacgga tgaaactctg 180 

agcaatgttg aggtgtttat gcaattaata tatgacagca gtctttgtga tttatttggc 240 

ggccatcacc atcaccatca ctaaggtccc gagctcgaat tctgcagata tccatcacac 300 

t 301 

<210> 75 
<211> 3282 
<212> DNA 

<213> Homo sapiens 
<400> 75 

gggacagggc tgaggatgag gagaaccctg gggacccaga agaccgtgcc ttgcccggaa 60 
gtcctgcctg taggcctgaa ggacttgccc taacagagcc tcaacaacta cctggtgatt 120 
cctacttcag ccccttggtg tgagcagctt ctcaacatga actacagcct ccacttggcc 180 
ttcgtgtgtc tgagtctctt cactgagagg atgtgcatcc aggggagtca gttcaacgtc 24 0 

gaggtcggca gaagtgacaa gctttccctg cctggctttg agaacctcac agcaggatat 300 
aacaaatttc tcaggcccaa ttttggtgga gaacccgtac agatagcgct gactctggac 360 
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attgcaagta tctctagcat ttcagagagt aacatggact acacagccac catatacctc 420 

cgacagcgct ggatggacca gcggctggtg tttgaaggca acaagagctt cactctggat 480 

gcccgcctcg tggagttcct ctgggtgcca gatacttaca ttgtggagtc caagaagtcc 540 

ttcctccatg aagtcactgt gggaaacagg ctcatccgcc tcttctccaa tggcacggtc 600 

ctgtatgccc tcagaatcac gacaactgtt gcatgtaaca tggatctgtc taaatacccc 660 

atggacacac agacatgcaa gttgcagctg gaaagctggg gctatgatgg aaatgatgtg 720 

gagttcacct ggctgagagg gaacgactct gtgcgtggac tggaacacct gcggcttgct 780 

cagtacacca tagagcggta tttcacctta gtcaccagat cgcagcagga gacaggaaat 840 

tacactagat tggtcttaca gtttgagctt cggaggaatg ttctgtattt cattttggaa 900 

acctacgttc cttccacttt cctggtggtg ttgtcctggg tttcattttg gatctctctc 960 

gattcagtcc ctgcaagaac ctgcattgga gtgacgaccg tgttatcaat gaccacactg 1020 

atgatcgggt cccgcacttc tcttcccaac accaactgct tcatcaaggc catcgatgtg 1080 

tacctgggga tctgctttag ctttgtgttt ggggccttgc tagaatatgc agttgctcac 1140 

tacagttcct tacagcagat ggcagccaaa gataggggga caacaaagga agtagaagaa 1200 

gtcagtatta ctaatatcat caacagctcc atctccagct ttaaacggaa gatcagcttt 1260 

gccagcattg aaatttccag cgacaacgtt gactacagtg acttgacaat gaaaaccagc 1320 

gacaagttca agtttgtctt ccgagaaaag atgggcagga ttgttgatta tttcacaatt 1380 

caaaacccca gtaatgttga tcactattcc aaactactgt ttcctttgat ttttatgcta 1440 

gccaatgtat tttactgggc atactacatg tatttttgag tcaatgttaa atttcttgca 1500 

tgccataggt cttcaacagg acaagataat gatgtaaatg gtattttagg ccaagtgtgc 1560 

acccacatcc aatggtgcta caagtgactg aaataatatt tgagtctttc tgctcaaaga 1620 

atgaagctcc aaccattgtt ctaagctgtg tagaagtcct agcattatag gatcttgtaa 1680 

tagaaacatc agtccattcc tctttcatct taatcaagga cattcccatg gagcccaaga 1740 

ttacaaatgt actcagggct gtttattcgg tggctccctg gtttgcattt acctcatata 1800 

aagaatggga aggagaccat tgggtaaccc tcaagtgtca gaagttgttt ctaaagtaac 1860 

tatacatgtt ttttactaaa tctctgcagt gcttataaaa tacattgttg cctatttagg 1920 

gagtaacatt ttctagtttt tgtttctggt taaaatgaaa tatgggctta tgtcaattca 1980 

ttggaagtca atgcactaac tcaataccaa gatgagtttt taaataatga atattattta 2040 

ataccacaac agaattatcc ccaatttcca ataagtccta tcattgaaaa ttcaaatata 2100 

agtgaagaaa aaattagtag atcaacaatc taaacaaatc cctcggttct aagatacaat 2160 

ggattcccca tactggaagg actctgaggc tttattcccc cactatgcat atcttatcat 2220 

tttattatta tacacacatc catcctaaac tatactaaag cccttttccc atgcatggat 2280 

ggaaatggaa gatttttttg taacttgttc tagaagtctt aatatgggct gttgccatga 2340 

aggcttgcag aattgagtcc attttctagc tgcctttatt cacatagtga tggggtacta 2400 

aaagtactgg gttgactcag agagtcgctg tcattctgtc attgctgcta ctctaacact 2460 

gagcaacact ctcccagtgg cagatcccct gtatcattcc aagaggagca ttcatccctt 2520 

tgctctaatg atcaggaatg atgcttatta gaaaacaaac tgcttgaccc aggaacaagt 2580 

ggcttagctt aagtaaactt ggctttgctc agatccctga tccttccagc tggtctgctc 2640 

tgagtggctt atcccgcatg agcaggagcg tgctggccct gagtactgaa ctttctgagt 2700 

aacaatgaga cacgttacag aacctatgtt caggttgcgg gtgagctgcc ctctccaaat 2760 

ccagccagag atgcacattc ctcggccagt ctcagccaac agtaccaaaa gtgatttttg 2820 

agtgtgccag ggtaaaggct tccagttcag cctcagttat tttagacaat ctcgccatct 2880 

ttaatttctt agcttcctgt tctaataaat gcacggcttt acctttcctg tcagaaataa 2940 

accaaggctc taaaagatga tttcccttct gtaactccct agagccacag gttctcattc 3000 

cttttcccat tatacttctc acaattcagt ttctatgagt ttgatcacct gattttttta 3060 

acaaaatatt tctaacggga atgggtggga gtgctggtga aaagagatga aatgtggttg 3120 

tatgagccaa tcatatttgt gattttttaa aaaaagttta aaaggaaata tctgttctga 3180 

aaccccactt aagcattgtt tttatataaa aacaatgata aagatgtgaa ctgtgaaata 3240 

aatataccat attagctacc caccaaaaaa aaaaaaaaaa aa 3282 

<210> 76 
<211> 463 
<212> DNA 

<213> Homo sapiens 



<400> 76 

tagaattcag cggccgctta attctagaag tccaaatcac tcattgtttg tgaaagctga 
gctcacagca aaacaagcca ccatgaagct gtcggtgtgt ctcctgctgg tcacgctggc 



60 
120 
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cctctgctgc 
cttcttcttc 
ggaagctgtt 
acgaagcctc 
aaaaactttc 
gtaaaggttt 



taccaggcca 
attagtgaac 
gcagccaagt 
attgcggaag 
atcctggttt 
caacgtcttg 



atgccgagtt 
ctctgttcaa 
taggagtgaa 
tcctggtgaa 
ccactgtctt 
ctttaataaa 



ctgcccagct 
gttaagtctt 
gagatgcacg 
aatattgaag 
tcaatgacac 
tcacttgctc 



cttgtttctg 
gccaaatttg 
gatcagatgt 
aaatgtagtg 
cctgatcttc 
tac 



agctgttaga 
atgcccctcc 
cccttcagaa 
tgtgacatgt 
actgcagaat 



180 
240 
300 
360 
420 
4 63 



<210> 77 
<211> 90 
<212> PRT 

<213> Homo sapiens 



<400> 77 



Met 


Lys 


Leu 


Ser 


Val 


Cys 


Leu 


Leu 


Leu 


Val 


Thr 


Leu 


Ala 


Leu 


Cys 


Cys 


1 








5 










10 










15 




Tyr 


Gin 


Ala 


Asn 
20 


Ala 


Glu 


Phe 


Cys 


Pro 
25 


Ala 


Leu 


Val 


Ser 


Glu 
30 


Leu 


Leu 


Asp 


Phe 


Phe 
35 


Phe 


He 


Ser 


Glu 


Pro 
40 


Leu 


Phe 


Lys 


Leu 


Ser 
45 


Leu 


Ala 


Lys 


Phe 


Asp Ala 


Pro 


Pro 


Glu 


Ala 


Val 


Ala 


Ala 


Lys 


Leu 


Gly 


Val 


Lys 


Arg 




50 










55 










60 










Cys 


Thr 


Asp 


Gin 


Met 


Ser 


Leu 


Gin 


Lys 


Arg 


Ser 


Leu 


He 


Ala 


Glu 


Val 


65 










70 










75 










80 


Leu 


Val 


Lys 


lie 


Leu 
85 


Lys 


Lys 


Cys 


Ser 


Val 
90 















3 



cm 
E 



